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1.  INTRODUCTION 

Let  X  denote  a  positive  random  variable  whose  underlying  distribution 
function  F  is  continuous  and  strictly  increasing  on  [0,a>].  For  x>0  let 
S(x)  be  the  tail  (survival)  probability  defined  by  S(x)  =  P(X>x)  =  l-F(x). 
For  0<p<l  let  xp  denote  the  (upper)  p^  quantile  of  X  defined  by 

S(xp)  =  1-F(xp)  =  P(X>Xp)  =  p  . 

In  this  report  methods  are  studied  for  estimating  tail  probabilities 
and  quantiles  based  on  a  random  sample  of  size  n  from  the  distribution  of  X. 
Interest  will  center  around  problems  of  estimating  S(x)  and  xp  when  n  is 
large  and  S(x)  and  p  are  both  quite  small.  Good  solutions  to  these  estima¬ 
tion  problems  are  particularly  useful  in  connection  with  extreme  value 
statistics.  For  the  probability  that  the  maximum  of  a  random  sample  of  size 
k  from  the  distribution  of  X  exceeds  x  is  equal  to  1  -  (l-S(x))  ;  and  the 
qth  quantile  of  this  sample  maximum  is  equal  to  x  ,  where  p  =  1  -  (1-q)^. 
Observe  that  if  k  is  large,  then  p  is  quite  small  -  in  fact,  approximately 
equal  to  k-1  1 og( 1/(1 -q ) ) . 

Global  parametric  estimators  typically  do  not  provide  good  solutions 
to  these  estimation  problems.  For  they  are  nonrubust  to  departures  from 
the  assumed  model  in  the  tails  of  the  distribution  of  X,  which  departures 
are  present  in  most  applications.  Standard  nonparametric  procedures,  based 
directly  on  empirical  tail  probabilities  suffer  from  lack  of  sufficient 
data  beyond  the  point  xp  of  interest  especially  if,  say,  p=l/n.  In  this 


3 


1.  INTRODUCTION 

Let  X  denote  a  positive  random  variable  whose  underlying  distribution 
function  F  is  continuous  and  strictly  increasing  on  [0,oo].  For  x>0  let 
S(x)  be  the  tail  (survival)  probability  defined  by  S(x)  =  P(X>x)  =  l-F(x). 
For  0<p<l  let  xp  denote  the  (upper)  p^  quantile  of  X  defined  by 

S(xp)  =  l-F(Xp)  =  P(X>xp)  =  p  . 

In  this  report  methods  are  studied  for  estimating  tail  probabilities 
and  quantiles  based  on  a  random  sample  of  size  n  from  the  distribution  of  X. 
Interest  will  center  around  problems  of  estimating  S(x)  and  xp  when  n  is 
large  and  S(x)  and  p  are  both  quite  small.  Good  solutions  to  these  estima¬ 
tion  problems  are  particularly  useful  in  connection  with  extreme  value 
statistics.  For  the  probability  that  the  maximum  of  a  random  sample  of  size 
k  from  the  distribution  of  X  exceeds  x  is  equal  to  1  -  (l-S(x))  ;  and  the 
qth  quantile  of  this  sample  maximum  is  equal  to  xp,  where  p  =  1  -  (l-q)1^. 
Observe  that  if  k  is  large,  then  p  is  quite  small  -  in  fact,  approximately 
equal  to  k-1  1 og( 1/(1 -q ) ) . 

Global  parametric  estimators  typically  do  not  provide  good  solutions 
to  these  estimation  problems.  For  they  are  nonrubust  to  departures  from 
the  assumed  model  in  the  tails  of  the  distribution  of  X,  which  departures 
are  present  in  most  applications.  Standard  nonparametric  procedures,  based 
directly  on  empirical  tail  probabilities  suffer  from  lack  of  sufficient 
data  beyond  the  point  xp  of  interest  especially  if,  say,  p=l/n.  In  this 


PRECEDING  PAGE  NOT  FILMED 
BLANK 


4 


report  estimators  will  be  proposed  and  studied  which  are  a  compromise 
between  completely  parametric  and  completely  nonparametric  procedures. 

The  estimation  problems  discussed  here  pertain  to  the  right  tail 
of  the  distribution  of  X.  But  the  same  methods  are  applicable  to  the  left 
tail,  which  is  also  important  in  applications.  In  particular  if  X  is  a 
positive  random  variable,  its  left  tail  can  be  estimated  by  applying  the 
procedures  described  here  to  the  right  tail  of  1/X.  Our  interest  in  pro¬ 
blems  of  estimating  tail  probabilities  and  extreme  value  distributions 
arose  initially  in  environmental  applications  (see  Breiman,  Stone  and 
Gins  [3]),  where  they  are  connected  with  violation  of  standards.  But 
the  same  problems  also  occur  in  reliability,  strength  of  materials, 
fatique  and  many  other  applications.  (See  Section  6.3  of  Gumbel  [6]  and 
Section  3.12  of  Galambos  [5]). 

The  specific  research  objectives  of  this  work  are  as  follows: 

a.  Define  exponential  tail  and  transformed  exponential  tail 
procedures  for  estimating  probabilities  in  the  upper  tail 
of  an  underlying  distribution  and  for  estimating  the 
corresponding  quantiles. 

b.  Study  analytically  the  bias  and  variance  of  the  estimators 
and,  for  the  transformed  exponential  tail  procedure,  verify 
the  existence  and  uniqueness  of  the  estimators. 

c.  Design  and  run  a  Monte  Carlo  experiment  to  supplement  the 
analytic  study  of  the  bias  and  variance  of  the  various 
estimators  for  a  variety  of  sample  sizes,  underlying  distri¬ 
butions  and  parameters  in  the  procedures. 

d.  Based  on  the  analytic  and  Monte  Carlo  results  draw  appropriate 
conclusions  and  make  suggestions  for  further  research. 
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2.  GENERAL  DISCUSSION 

In  Section  2.1  the  main  estimators  considered  in  this  report  are 
defined  and  some  of  their  basic  properties  are  surranarized.  In  Section  2.2 
some  alternative  estimators  are  noted.  A  Monte  Carlo  experiment  has  been 
designed  to  compare  all  these  estimators  under  a  variety  of  sample  sizes 
and  underlying  distributions.  The  results  of  this  experiment  are  pre¬ 
sented  in  Section  2.3.  In  Section  2.4  conclusions  are  drawn  based  on  the 
research  done  to  date  and  suggestions  are  made  for  further  work.  More 
detailed  discussions  of  the  material  in  this  section  are  given  in  Sections 
3  through  6. 

2.1  EXPONENTIAL  TAIL  AND  W'  3ULL  TAIL  ESTIMATORS 

Let  the  values  obtained  in  a  random  sample  of  size  n  from  the 
distribution  of  X  be  written  in  decreasing  order  as  X^>...>Xn. 

,  Suppose  first  that  X  has  an  exponential  distribution  with  a  mean  a. 

Then  the  tail  probabilities  are  given  by  S(x)  =  e  ,  x>0.  The  maximum 

A 

likelihood  estimator  of  the  parameter  a  is  given  by  a  =  (X-.+...+X  )/n. 

a  •  n 

(  This  leads  to  the  estimator  S(x)  =  e"X//a  of  S(x). 

Let  xQ  be  a  positive  number.  If  X  is  exponential  with  mean  a, 

then  the  conditional  distribution  of  X-xQ  given  that  X>Xq  is  exponential 

I  with  mean  a.  Suppose  the  latter  but  not  necessarily  the  former  property 

-(x-xQ)/a 

holds.  Then  S ( x )  =  S(xg)e  ,  x>Xg.  The  parameter  S(xq)  can  be 

A  1 

estimated  by  S(xQ)  =  n  #{i:X..>x0},  where  #  I  denotes  the  number  of 
I  elements  in  the  set  I.  Set  m  =  #{i :Xi>xQ}.  If  m> 1 ,  then  maximum 


» 
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likelihood  considerations  suggest  estimating  a  by  a  =  m"  ^  (X.-xn). 

-  1  u 
This  now  leads  to  the  estimator  S(x)  of  S(x)  given  by 


m 


^  A  -(x-xQ)/a 
S(x)  =  S(xQ)e  ,  x>x 


-  0 


In  practice  a  positive  integer  m  is  chosen  such  that  2<m<n  and  the 
preceeding  estimator  is  applied  with  xQ  =  X  ^  That  is. 


a'  -  ll  Cx,  -  VP 


m 


and 


•  *-Vi 


(2,1) 


(2.2) 


Associated  with  (2.2)  is  the  estimate  of  xp  given  by 


0  -  v  +  2  m+1  n.,  m+1 

x  -  X  +  a  log  — —  ,  0<p  <  - 

p  m+1  3  np  —  n 


(2.3) 


In  order  to  complete  the  definition  of  S(x)  let  the  empirical  estimator  be 
be  used  for  x<X  ^ ;  that  is,  set 


S(x)  =  ^  #{i:X.>x}  ,  0<x<X. 


m+1 


(2.4) 


Similarly,  to  complete  the  definition  of  xp  set 

m+1 


Xp  ■  *r(np)  •  IT  «  • 


cz.s: 


where  f(y)  denotes  the  smallest  integer  which  equals  or  exceeds  y  (a 
notation  borrowed  from  APL).  The  estimators  given  by  (2.2)  and  (2.3)  are 
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called  exponential  tail  estimators.  They  have  been  used  successfully  in 
a  number  of  applications,  starting  with  Breiman,  Stone  and  Gins  [3J.  Very 
similar  estimators  have  been  proposed  by  Hill  [7]  and  Weissman  [18]. 

Hopefully  when  (2.2)  and  (2.3)  are  applicable  they  should  yield 
more  accurate  estimates  than  the  empirical  estimators  given  by  (2.4)  and 

A 

(2.5).  Suppose  in  particular  that  p=l/n,  let  x^n  be  the  estimator  of 
the  (l/n)th  quantile  x^n  given  by  (2.3)  and  let  Mn  =  be  the  estimator 
of  x-]/n  given  by  (2.5).  (Note  that  Mn  is  the  maximum  value  in  the  random 
sample  of  size  n.)  It  is  reasonable  to  expect  that  for  some  choice  of  m 
as  a  function  of  n  with  nuoo  and  m/n-43  as  n-H» 


lim 

n-*» 


E(Si1/n  -  W 
E<Hn  -  ‘I// 


2 


=  0 


Theorem  6.3  contains  a  more  general  result  for  a  large  class  of  underlying 
distribution  functions  F. 

Suppose  now  that  Xb  has  an  exponential  distribution  with  mean  a  for 
some  b>0  (so  that  X  itself  has  a  Weibull  distribution)  and  let  xQ  be  a  posi¬ 
tive  number.  Then  the  conditional  distribution  of  Xb  -  Xg  given  that  X>Xg 
is  exponential  with  mean  a.  Thus 

h  h  h  h  -(x  ~xn''a 

P(X>X|X>Xg)  =  P(XD  -  Xg  >  X  “  Xg|X>Xg)  =  6  »  Xj>  X  g  > 


and  hence 

-(.xb-xb]/a 

S(x)  =  S(xg)e  ,  x>Xg  .  (2.6) 


This  suggests  estimators  of  S(x)  of  the  form  given  by 
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S(x)  =  S(xQ)e 


-(xb-Xg)/a 


x>x 


zro 


As  before,  replace  xQ  by  Xm+-j  and  set  S(Xg)  =  (m+l)/n.  This  leads  to  the 
estimator  of  S(x)  given  by 


*  ^  -(x^-xf+1)/a 

S(x)  =  — -  e  m  1  ,  x>X 


-  m+1 


(2.7) 


Associated  with  (2.7)  is  the  estimator  of  xp  given  by 


-  X 


♦a  log  SI  I  ,  0<p<^  .  (2.8) 


m+1 


np 


1/b 


m+1 


_  n 


The  estimators  given  by  (2.7)  and  (2.8)  are  called  Weibull  tail  estimators. 
They  are  supplemented  by  (2.4)  and  (2.5)  respectively.  (For  similar  estimator 
see  Weinstein  [16]  and  followup  papers  by  Jeruchim  [8]  and  Filimonov  [4].) 

Of  course  in  order  to  complete  the  definitions  of  Weibull  tail 

A  A 

estimators,  it  is  necessary  to  specify  a  and  b.  Given  b,  maximum  likeli¬ 
hood  considerations  suggest  estimating  a  by 


A 

a 


There  are  a  variety  of  reasonable  ways  to  choose  b.  One  approach  is 
maximum  likelihood  based  on  an  assumed  Weibull  tail  form  for  the  under¬ 
lying  distribution  F.  But  this  approach  leads  to  numerical  solution  of 
an  equation  with  no  particularly  nice  properties  (such  as  always  having 
a  unique  solution).  Also,  the  maximum  likelihood  approach  is  not 
theoretically  compelling  unless  the  assumed  theoretical  form  is  really 
believed  to  be  valid. 
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Another  approach  is  to  make  some  feature  of  the  empirical 

/V  A 

h  * 

distribution  of  (X?  -  X^^/a,  l£i<m,  match  up  with  the  corresponding 
feature  of  the  standard  exponential  distribution.  The  test  for  exponential- 
ity  due  to  Shapiro  and  Wilk  [14]  (see  also  Stephens  [15])  suggests  making 

A  A 

L  L.  A. 

the  empirical  second  moment  of  (X°  -  x“+1)/a,  l<i<m,  equal  to  the  second 
moment  of  the  standard  exponential  distribution,  namely  2.  According  to 

A 

this  criterion,  b  should  be  chosen  to  satisfy  the  equation. 

U(b)  =  2  ,  (2.9) 

where 


It  is  shown  in  Section  4.2  that  U  is  a  continuous  strictly  increaasing 
function  on  (0,°°)  which  has  limits  at  0  and  °°  given  by  U(°°)  =  m>2  and 


U(0)  = 


Thus  (2.9)  has  at  most  one  solution.  It  does  have  a  solution  if  and  only 
if 
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m  1 


(4,09r 


'm+ly 


(2.10) 


Since  U  is  continuous  and  strictly  increasing--indeed,  it  has  a  strictly 
positive  derivative--on  (0,°°)  it  is  easy  to  solve  (2.9)  numerically  when 
(2.10)  holds.  When  (2.10)  fails  to  hold  the  estimators 


and 


/V 


0<p<S(xQ),  (2.12) 


with 


=  mS  lQ5 


X. 

Vi 


are  used  instead  of  the  Wei  bull  tail  estimators.  The  estimators  given  by 
(2.11)  and  (2.12)  are  called  Pareto  tai 1  estimators  (see  Hill  [7]).  They  arise 

A 

as  limits  of  Wei  bull  tail  estimators  by  letting  b-*0. 

A  general  form  of  transformed  exponential  tail  estimators  is  discussed 
in  Section  4.1.  This  form  includes  exponential  tail,  Pareto  tail  and  Weibull 
tail  estimators  as  special  cases.  Other  special  cases  are  mentioned  in 
Section  4.3.  One  such  special  case,  which  provides  a  generalization  of  the 
Pareto  tail  estimators,  is  always  applicable  when  (2.10)  fails  to  hold. 

(For  another  fairly  general  approach  to  the  estimation  of  tail  probabilities, 
see  Pickands  [12]. ) 
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In  a  modified  setting  in  Section  5  an  optimality  property  is  shown 
to  hold  for  Wei  bull  tail  estimators  when  the  underlying  distribution  F  is  a 
tail  Weibull  distribution;  i.e.,  when  (2.6)  holds  for  some  Xg  and  some 
choice  of  the  positive  parameters  a  and  b. 

2.2  OTHER  ESTIMATORS 

Given  1<N<100,  exponential  tail  and  Weibull  tail  estimators  with 
m  =  T ( Nn/1 00 )  are  referred  to  as  exponential  tail  N%  and  Weibull  tail  N% 
estimators.  In  the  Monte  Carlo  experiment  these  estimators  were  compared 
with  various  other  estimators,  which  will  now  be  described. 

The  empirical  estimators  of  S(x)  and  xp  are  defined  by 

S(x)  =  ^  #(i.:X.j>x}  ,  x>0,  (2.13) 

and 

Vxr(nP)  •  0<^-  <2-14> 

If  has  an  exponential  distribution  with  mean  a  (so  that  X  itself 

b . 

has  a  Weibull  distribution), then  S(x)  =  e~x  /a  .  This  suggests  estimators 

/v 

S ( x )  of  the  form 

*  b.: 

S ( x )  =  e'x  /a  ,  x>0  ,  (2.15) 

A  A 

where  the  positive  numbers  a  and  b  are  chosen  from  the  sample  data.  The 
corresponding  estimator  of  xp  is  given  by 


12 


a  /\  i  1  /  b 

x  =  (a  log  ,  0<p<l  .  (2.16) 

r  r 


The  parameters  a  and  b  can  be  chosen  by  the  maximum  likelihood 
method.  According  to  this  method 


/v  i  n  I 

,  (2.17) 


while  b  satisfies  the  equation 


log  Xi 


The  right  side  of  (2.18)  is  continuous  and  strictly  increasing  in  b. 
(indeed, the  derivative  is  strictly  positive)  on  (0,®)  and  has  limit 

.a.  I  n  a 

as  b  +  0  and  limit  log  X,>n~  £  log  X.  as  b->-°°.  Thus  (2.18)  has  a  unique 

1  1 

solution,  which  is  easily  found  numerically.  The  estimators  given  by  (2.15) 

A  /V 

and  (2.16)  with  a  and  b  given  by  (2.17)  and  (2.18)  are  called  Weibul 1 
maximum  likelihood  estimators. 

/\  A, 

For  an  alternative  method  of  choosing  a  and  b,  pick  m  and  m1  with 

A  A  A  A 

l<m'<m<n.  Choose  a  and  b  such  that  S ( Xm , )  =  m'/n  and  S(Xm)  =  m/n.  These 
two  equations  can  be  solved  explicitly,  yielding  the  formulas 


-  A  -  TT  I  log  X.  .  (2.18) 

b  n  1  1 
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a  = 


log  - 
s  m 


(2.19) 


and 


b  = 


loq  1  oa  ~r  —  log  log 

log  X  ,  -  log  Xm 
3  m  3  m 


(2.20) 


The  estimators  given  by  (2.15)  and  (2.16)  with  a  and  b  given  by  (2.19)  and 
(2.20)  are  called  Weibull  percentile  estimators.  (Precisely,  (2.15)  and 
(2.16)  are  applied  when  x>_X  and  0<p  <m/n  respectively;  otherwise,  the 
empirical  estimators  given  by  (2.13)  and  (2.14)  are  used.)  For  1<N<100  the 
Weibull  percentile  estimators  with  m  =  r(Nn/100)  and  m'=  ("(Nn/1000)  are 
referred  to  as  Weibull  N%  estimators. 

2 

If  log  X  has  a  normal  distribution  with  mean  y  and  variance  a  (so 
that  X  itself  has  a  lognormal  distribution),  then  S(x)  =  Q(o~^(log  x-y)),  where 


oo  2 

Q(x)  =  ~  f  e-y  ^dy  . 

2tt 

This  suggest  estimators  of  S(x)  of  the  form 

A 

S(x)  =  }  x>0  #  (2.21) 

/V  /v 

where  y  and  a>0  are  chosen  from  the  sample  data.  The  corresponding  estimator 
of  x  is  given  by 

p  3 
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x  =  eu+a 


0<p<l . 


(2.22) 


A  A 

The  numbers  y  and  a  can  be  chosen  by  the  maximum  likelihood  method, 
which  yields 


;  =  ^2>g  X.  (2.23) 

and 

o2=  (log  Xri)2  .  (2.24) 

n  1  i 

/v  /V 

The  estimators  given  by  (2.21)  and  (2.22)  with  y  and  a  given  by  (2.23) 
and  (2.24)  are  called  lognormal  maximum  likelihood  estimators. 

A  A 

For  an  alternative  method  of  choosing  y  and  a  ,  pick  m  and  m'  with 

A  A  A  /V 

l<m'<m<n.  Choose  y  and  a  such  that  S(Xm,)  =  m'/n  and  S(Xm)  =  m/n.  These 

two  equations  can  be  solved  explicitly,  which  yields  the  formulas 


y  =  log  X  -a  Q”^  (— ) 
3  m  v  vn' 


(2.25) 


and 


a 


lQg  Xm'-1og  Xm 


(2.26) 


The  estimators  given  by  (2.21)  and  (2.22)  with  y  and  a  given  by  (2.25)  and 
(2.26)  are  called  lognormal  percentile  estimators.  (Precisely,  (2.21) 
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and  (2.22)  are  applied  when  x>X^  and  0<p<m/n  respectively;  otherwise  the 
empirical  estimators  given  by  (2.13)  and  (2.14)  are  used.)  For  1<N<100 
lognormal  percentile  estimators  with  m=  T ( Nn/ 1 00 )  and  m'=  T ( Nn/ 1 000 )  are 
referred  to  as  lognormal  N%  estimators. 

A  A  A 

Let  a  pair  of  estimators  S(x)  and  xp  be  written  as  S(x;  )f^...,Xn) 

and  X  (X^,...,Xn)  to  denote  explicitly  the  dependence  on  the  sample  data. 

The  pair  is  said  to  be  scale  invariant  If  for  all  choices  of  x>0,  0<p<l , 

c>0  and  X,>. . .>X  >0 
I  n 

A  A 

S(cx,  cX1,...cXn)  =  S(x,X1 ,. . .  ,Xn) 
and 


Xp  ( cXi , . . . ,  cXn )  s  cx  (Xr...,Xn). 


The  pair  is  said  to  be  power  invariant  if  for  all  choices  of  x>0,  0<p< 1 , 

c>0  and  X,>. . ,>X  >0 
1  n 


and 


S(xC,X^,...,x£)  =  S(x,Xr...,Xn) 
Xp(X] » •  •  •  ’Xp)  =  (xp(X-| » —  *xn ) ) 


All  of  the  above  estimators  are  scale  invariant,  and  all  except  exponential 
tail  estimators  are  power  invariant. 
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2.3  MONTE  CARLO  COMPARISONS 

A  Monte  Carlo  experiment  was  performed  to  determine  the  accuracy 
of  exponential  tail  and  Weibull  tail  estimators  of  tail  probabilities  and 
quantiles  and  to  compare  these  estimators  with  the  other  estimators 
described  in  Section  2.2.  Specifically  the  following  twelve  estimators 
were  evaluated: 

empirical ; 

exponential  tail  5%,  10%,  25%; 

Weibull  tail  25%,  50%; 

Weibull  maximum  likelihood; 

Weibull  percentile  10%,  25%; 
lognormal  maximum  likelihood; 
lognormal  percentile  10%,  25%. 

Twenty  underlying  distribution  functions  were  considered.  They  will 

be  defined  in  terms  of  four  groups  each  having  five  distribution  functions, 

the  distribution  functions  in  the  it,n  group  being  denoted  by  F..,  l<j<5. 

■  J 

The  distributions  functions  in  each  group  will  depend  on  five  values  b., 

3 

l£j£5,  of  a  parameter  b  given  explicitly  by  b^=.5,  b2=.75,  b3=l ,  b4=l . 5 
and  b5=2. 

The  first  group  consists  of  Weibull  distribution  functions,  defined 

by 

F1j(x)  =  1  -  e"x  ,  x>0  . 

In  particular  F^3  is  the  standard  exponential  distirbution  function,  given 
by  P-j 3 ( x )  =  l-e'x,  x>0.  Let  X  be  a  random  variable  having  distribution 
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1/bi 

function  F^.  Then  X  J  has  distribution  function  F^.  for  l£j£5.  Let 

X,>...>X  be  the  ordered  sample  values  based  on  a  random  sample  of  size  n 
1  n  1/b.  1/b 

from  the  standard  exponential  distribution  function  F^.  Then  X-j  J>...>Xn  J 

can  be  thought  of  as  the  ordered  sample  values  based  on  a  random  sample  of 

size  n  from  F^ .  This  observation,  together  with  the  power  invariance  of 

all  but  exponential  tail  estimators,  leads  to  substantial  computational 

savings  in  the  Monte  Carlo  experiment.  The  same  observation  and  savings  are 

applicable  to  each  of  the  other  three  groups  of  five  distribution  functions 

defined  below. 

The  second  group  consists  of  mixture  of  the  corresponding  Weibull 

distributions  in  the  first  group.  Specifically  let  X  now  be  a  random 

variable  which  with  probability  1/2  is  chosen  from  an  exponential  distribution 

with  mean  1/3  and  with  probability  1/2  is  chosen  from  an  exponential 

distribution  with  mean  5/3.  Then  X  has  mean  1  as  before.  For  l<j<5,  let 

1/b. 

F„ .  be  the  distribution  function  of  X  then 


F2j (x)  =  {^^(3^  x)  +  Fi;j((|)bJ  x)],  x>0  . 

The  third  group  consists  of  lognormal  distributions.  Specifically 

let  X  now  be  a  lognormal  random  variable  having  unit  mean  and  variance. 

1/b. 

For  l<j<5  let  F  .  be  the  distribution  function  of  X  J.  If  X.  has 

distribution  function  F_ . ,  then  log  X.  is  normally  distributed  with  mean 

J 

2 

u-  and  variance  a.  given  by 
J  J 


log  2 
"2bj 


and 


_2  loa  2 

VIP 

J 


•U. 
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also 


F3j(x) 


x>0  . 


The  fourth  group  consists  of  mixtures  of  the  corresponding  lognormal 
distributions  in  the  third  group.  Specifically  for  l<j<5. 


F4  j  ( x )  =  2*-F33 


b. 

(3  Jx) 


*  F33<l>bjxl 


x>0 


In  particular 


F43(x)  =  ^F33(3x)  +  F33^  5  ^ ’  x>0  • 

1/b. 

If  X  has  distribution  function  ,  then  X  J  has  distribution  function  f4J- 
Wei  bull  and  lognormal  distributions  were  chosen  because  they  are 
commonly  employed  in  reliability,  environmental  and  other  applications  where 
extreme  values  play  an  important  role.  The  indicated  mixtures  were  included 
to  give  a  larger  diversity  of  shapes.  They  were  motivated  in  part  by  the 
useful  role  that  normal  mixtures  have  played  in  robustness  studies  of 
estimators  of  a  location  parameter.  The  selected  values  .5,  .75,  1,  1.5  and 
2  of  the  parameter  b  were  chosen  based  on  experience  in  fitting  Weibull 
distributions  to  environmental  data. 

It  is  useful  to  define  a  measure  of  the  heaviness  of  the  distribution 
function  of  the  random  variable  X  in  order  to  classify  the  distribution  function 
as  heavy-tailed,  medium-tailed  or  light-tailed.  To  this  end,  suppose  that  the 
function  £  defined  by  £(x)=  -log  P(X>x)=  -log  S(x)  is  twice  differentiable  and 
has  positive  first  derivative  on  (0,«>).  It  is  convenient  to  require  that  the 


measure  of  heaviness  be  local  and,  specifically,  that  it  depend  only  on  the 


first  two  derivatives  of  l  evaluated  at  a  fixed  quantile  Xp,  where  0<p<l ;  let 

the  measure  be  denoted  by  H^(p).  It  is  natural  to  require  that  Hx(p)  be 

scale  invariant;  i.e.,  that  hcX(p)  =  Hx(p)  for  c>0.  It  is  also  natural  to 

require  that  if  X  is  exponentially  distributed,  then  Hx(p)=0,  H  y[3(p)>0  for 

I  /h  ^ 

0<b<l,  and  H  •!/{-,( P )<0  for  b>l  (note  that  X  '  has  a  Weibull  distribution  with 
X 

shape  parameter  b). 

Define  r  on  (0,°°)  by 


r(x)  - 

(i'(x)r 


x>0 , 


and  set 


HX(P)  =  -r(xp). 

Then  Hx(p)  satisfies  all  the  criteria  mentioned  in  the  previous  paragraph. 
It  is  easily  seen  that 


Hxi/b(p)  =  syny  *  Hx(p)  for  b>0- 


(2-27) 


In  particular,  if  X  is  exponentially  distributed,  then 


Hxl/b(p>  =  b  logO/p)  fpr  6>0- 

Observe  here  that  H  ^(p)  converges  to  zero  slowly  as  p-*».  If  log  X  is 
X  2 

normally  distributed  with  variance  a  ,  so  that  X  itself  is  lognormal ly 
distributed,  the  convergence  of  Hx(p)  to  zero  as  p-*0  is  even  slower; 
specifically,  by  straightforward  calculations, 
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I 


Hy(p)  =  (cr+z  )  Tr|-r  -  1  ~  - q  ~Yi,  as  p^O, 

X  P  *(V  (2  log  l)*5 

where  4> * i s  the  density  of  the  standard  normal  distribution  function  4>  and 
1  -  *(zp)  =  p.  » 

The  values  of  H^(p)  corresponding  to  the  twenty  underlying  distributions 
used  in  the  Monte  Carlo  experiment  are  shown  in  Table  1.  The  different  rows 
correspond  to  the  different  groups,  indicated  under  "F"  by  the  following  « 

abbreviations:  WI  (Weibull),  WM  (Weibull  mixture),  LN  (lognormal  mixture), 
and  LN  (lognormal).  The  different  columns  correspond  to  the  indicated  values 
for  the  parameter  b.  (By  (2.27),  within  each  group  the  heaviness  is  a  linear 
function  of  b"1  with  positive  slope.) 

According  to  Table  1  among  the  twenty  underlying  distribution  functions 
there  is  a  preponderance  of  heavy-tailed  distribution  functions.  The  th^ee 
heaviest-tailed  distribution  functions  F^-p  and  F^  have  much  heavier 
tails  than  those  which  typically  arise  in  the  environmental  applications 
in  which  exponential  tail  estimators  were  originally  developed.  The  relative 
performance  of  exponential  tail  estimators  in  the  numerical  comparisons  which 
follow  would  be  significantly  improved  if  results  for  these  three  distribu¬ 
tion  functions  were  ignored  in  making  the  comparisons. 

Table  1.  Heaviness  of  distribution  functions  at  x  -j 


F 

.50 

.75 

1.00 

1.50 

2.00 

WI 

.43 

.14 

.00 

-.14 

-.22 

WM 

.64 

.23 

.03 

-.18 

-.28 

LM 

.80 

.41 

.22 

.02 

-.07 

LN 

.68 

.36 

.20 

.05 

-.03 

In  the  Monte  Carlo  experiment  the  sample  size  n  took  on  the  values 
100,  200,  400  and  800.  The  quantiles  xp  were  estimated  for  p  =  1/50,  1/100, 
1/200,  1/400,  1/800  and  1/1600.  Tail  probabilities  were  estimated 
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as  x  ranged  over  the  quantiles  xp  (p=l/50, ... ,1/600)  of  whatever  under¬ 
lying  distribution  was  being  simulated  at  the  time. 

Two  different  measures  were  used  initially  to  evaluate  estimators 
of  tail  probabilities,  namely 

R?j(S(xp))  »  Efj(S(xp)  -  S(xp))2 
and 

(S(Xp) )  =  E^. (log  S(xp)  -  log  S(xp))2  . 

Here  E..  refers  to  expectation  when  F..  is  the  underlying  distribution 

'  J  1  J 

4L 

function  and  x  is  the  pcn  quantile  of  F...  Note  that  the  second  measure 
P  ■  J 

2 

L.  ■  is  inapplicable  to  the  empirical  estimator  of  S(x  )  since  the  estimate 

*  J  r 

has  positive  probability  of  being  zero.  The  measure 


Rii{xp}  = 

1J  P  E .  . (X 
ij 


i/Vp1' 


r(pn)"  V 


2  ~ 

was  used  initially  to  evaluate  estimators  of  quantiles.  Note  that  R..(x  ) 

i  j  p 

A 

is  the  ratio  of  the  mean  square  error  of  the  estimator  xp  to  the  mean  square 
error  of  the  empirical  estimator  x|-(pn)  of  xp.  The  use  of  this  measure  was 
motivated  in  part  by  the  asymptotic  results  for  exponential  tail  estimators 
obtained  in  Section  6. 

Monte  Carl  estimates  of  the  above  measures  were  obtained  for  each 
estimator  and  each  quantity  to  be  estimated  by  averaging  over  a  number  of 
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replications.  Specifically  for  sample  sizes  100,  200,  400  and  800  there 
were  respectively  600,  400,  300  and  200  replications.  (The  relative  standard 

/V  A  /\ 

errors  (see  Section  3.3)  of  R..(S(x  )),  L..(S(x  ))  and  R..(x  )  were  typically 

lJP'JP  P 

about  .05  or  less.  This  is  small  enough  to  warrant  the  qualitative  conclusions  ! 

which  follow. ) 

Summary  statistics  were  then  obtained  by  averaging  these  measures 
over  the  twenty  underlying  distribution  functions  and  then,  for  ease  in  / 

presentation,  taking  square  roots.  The  results  are  given  in  Section  3. 

Next,  in  light  of  the  summary  statistics  in  Section  3.3,  it  was 
decided  to  look  at  five  estimators  for  each  sample  size:  empirical;  exponen¬ 
tial  tail  N%  for  the  best  overall  value  of  N  (for  the  given  sample  size  n 
and  p=l/n);  Wei  bull  tail  50%;  Wei  bull  percentile  N%  for  the  best  overall 
value  of  N;  and  lognormal  percentile  N%  for  the  best  overall  value  of  N. 

The  specific  values  of  N%  are  as  follows: 


Estimator 

n 

100 

200 

400 

800 

Exponential  tail 

25% 

10% 

10% 

5% 

Wei  bull  percentile 

25% 

25% 

10% 

10% 

lognormal  percentile 

25% 

25% 

10% 

10% 

each  sample  size  n  and  for 

p=l/n  the 

quantities 

min  R--(S(x  )), 

^  'Jr 

SV 

min  L..(S(x  )), 

/v  'Jr 

s(xp) 


rr 


r 
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and 


min  R..(x  ) 
-  U  P 


were  evaluated.  Here  the  first  and  third  minimizations  were  over  all  five 
estimators  selected  for  the  given  sample  size;  while  the  second  minimization 
involved  all  of  these  estimators  except  the  empirical  estimator,  for  which 

A 

Li j (S(x  ) )  isn't  applicable.  Corresponding  "efficiencies"  were  defined  by 


~  min  R..(S(x  )) 

eu(s(V>  -  Sc,)  J  P 


RfJ<s(Xp» 


min  L..(S(x  )) 

eij(S(xp))  -  S(xp) 


and 


Lfj(s(xp,) 


mi"  HfJC*p) 


e. .(x  ) 
ij  P 


R. . (x  ) 
ij  P 


(Note  that  the  usual  definition  of  efficiency  is  the  square  of  the  definition 

L  A 

used  here.  Note  also  that  the  measure  e..(S(x  ))  is  not  applicable  to  the 

ij  p 

empirical  estimator.)  These  efficiencies  were  then  averaged  over  the  twenty 


mqr»v.:A — p 


-r’?:  t  -- x*  -  •— r— — 
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underlying  distribution  functions,  which  yielded  measures  denoted  by 

_  a  ~ 

e(S(x  )),  e  (S(x  ))  and  e(x  ).  The  results  are  shown  in  Table  2.  Here 

r  H  M 

the  abbreviations  EM  (empirical),  LP  (lognormal  percentile),  WP  ( Wei  bull 
percentile),  WT  ( Wei  bull  tail)  and  ET  (exponential  tail)  are  used  for  the 
five  estimators. 

A 

Relative  to  the  average  efficiency  e(x  )  of  the  estimators  of  the 
quantile  xp  when  p=l/n,  the  exponential  tail  estimator  does  essentially 
as  well  as  or  better  than  any  of  the  other  estimators  for  each  of  the  four 
sample  sizes  considered.  This  rather  surprising  result  lends  much  credence 


to  exponential  tail  estimators  of  x 


1/n* 


The  Wei  bull  tail  estimator  comes 


*  out  a  close  second.  It  should  be  noted  that  the  exponential  tail  estimator 


does  substantially  better  than  the  empirical  estimator,,  but  that  this  improve¬ 
ment  is  rather  insensitive  to  the  sample  size  n.  The  theoretical  results  in 
Section  6  suggest  increasing  improvement  with  increasing  sample  size.) 

A 

Relative  to  the  average  efficiency  e(S(xp))  of  estimators  of  the 
tail  probability  S(x)  when  this  probability  actually  equals  1/n,  the  Weibull 
tail  estimator  overall  does  the  best,  while  the  Weibull  percentile  estimator 
comes  out  a  close  second  and  the  estimator  tail  estimator  is  third  best  but 


still  performs  reasonably  well.  Surprisingly  different  results  are  obtained 
when  the  average  efficiency  eL(S(xp))  is  used  instead  of  e(S(xp)).  Now  the 
lognormal  percentile  estimator  does  best  for  each  of  the  four  sample  sizes; 
the  Weibull  tail  estimator  is  a  close  second  for  the  two  larger  sample  sizes, 
but  there  is  no  good  competitor  to  the  lognormal  percentile  estimator  for 
the  two  smaller  sample  sizes. 


Table  2.  Average 

efficiencies  for 

p  =  l/n 

n 

Measure 

ET 

WT 

WP 

LP 

EM 

100 

A 

e(S (x  ) ) 

— L  ~  H 

.88 

.97 

.86 

.66 

.71 

e  (S(xp)) 

.74 

.53 

.61 

.94 

e(xp) 

.97 

.86 

.81 

.49 

.41 

200 

¥(S(x  )) 

.84 

.98 

.92 

.60 

.68 

L  D 

e  (S(x  )) 

.74 

.76 

.69 

.95 

-  P 

e(xp) 

.96 

.92 

.89 

.51 

.42 

400 

e(S (xp)  ) 

.87 

.92 

.95 

.57 

.63 

^(SCx  )) 

.74 

.85 

.60 

•  88 

—  /\  P 
e(x  ) 

.94 

.92 

.83 

.49 

.41 

P 

800 

e(S(x  )) 

.84 

.91 

.91 

.57 

.59 

T  5 

eL(S(x  )) 

_  /A  * 

.72 

.86 

.66 

.89 

e(x  ) 

.90 

.91 

.80 

.54 

.35 

p 
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It  was  next  decided  to  determine  how  the  efficiency  depends  on 
the  choice  of  the  underlying  distribution  function.  Except  for  exponential 
tail  estimators  the  various  efficiencies  e^  are  relatively  insensitive  to 
j  and  can  be  approximated  satisfactorily  by  (e^  +  ...  +  e..g)/5.  This 

A 

should  be  expected, since  for  the  other  estimators  R..(S(x  ))  and  (when 

I J  K 

applicable)  L1-j (S(Xp) )  are  independent  of  j  and 


R. .  (x  ) 
ij  P 


/~l/b.  1/b. 

Ei 3  \xp  ^  -  *p  3 


1/b. 

E i 3 \X  T (pn) 


1/b, 


For  exponential  tail  estimators,  however,  e,.  varies  considerably  with  j. 

*  J 

Thus  we  decided  to  calculate  (e^  +.,.+  ei5)/5  for  each  of  the  (four  or) 
five  estimators  and  in  addition  to  calculate  e..,  l£j£5,  for  exponential 

'  J 

tail  estimators.  The  results  are  shown  in  Table  3.  The  efficiencies 
e | j ,  l<j<5,  for  the  exponential  tail  estimators  are  shown  in  the  first 
five  columns,  the  headings  denoting  the  corresponding  values  of  b..  The 

J 

entries  under  the  column  labeled  "F"  describe  the  distribution  functions 
t  h 

in  the  i  group:  WI  (Weibull),  WM  (Weibull  mixture),  LM  (lognormal 
mixture)  and  LN  (lognormal). 

Table  3  sheds  light  mainly  on  the  behavior  of  exponential  tail 
estimators.  It  is  worthwhile  to  look  at  the  first  five  columns  of  Table  3 
in  conjunction  with  Table  1.  As  expected,  exponential  tail  estimators 
perform  very  well  when  the  heaviness  of  the  underlying  distribution 
function  is  near  zero. 


I 
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Table  3-  Individual  efficiencies  for  p-l/n 


n 

Measure 

F 

.50 

.75 

1.00 

1.50 

2.00 

ET 

WT 

WP 

LP 

EM 

100 

e(S(xp)) 

WI 

.91 

1.00 

.99 

.85 

.76 

.90 

.99 

.69 

.65 

.72 

WM 

.83 

.98 

.98 

.80 

.68 

.86 

1.00 

.85 

.60 

.68 

LM 

.56 

.84 

.97 

1.00 

.95 

.86 

.95 

1.00 

.69 

.72 

LN 

.58 

.89 

1.00 

1.00 

1.00 

.89 

.96 

.88 

.71 

.74 

eL(S(xp)) 

WI 

.40 

.75 

1.00 

1.00 

1.00 

.83 

.48 

.55 

.91 

WM 

.26 

.55 

.85 

1.00 

1.00 

.73 

.64 

.73 

.96 

LM 

.22 

•  43 

.66 

1.00 

1.00 

.66 

.48 

.61 

.96 

LN 

.29 

.54 

.77 

1.00 

1.00 

.72 

.53 

.54 

.93 

e(Xp) 

WI 

1.00 

1.00 

1.00 

.91 

.77 

.94 

.92 

.69 

.51 

.49 

WM 

1.00 

1.00 

1.00 

.92 

’  .72 

.93 

.89 

.75 

.40 

.51 

LM 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

.80 

.91 

.48 

.34 

LN 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

.81 

.89 

.58 

.32 

200 

e(S (Xp) ) 

WI 

.84 

.91 

.  91 

.87 

.84 

.87 

1.00 

.70 

.57 

.67 

WM 

.84 

.94 

.93 

.85 

.79 

.87 

.97 

1.00 

.48 

.65 

LM 

.49 

.74 

.85 

.90 

.90 

.78 

.93 

1.00 

.63 

..66 

LN 

.56 

.80 

.90 

.96 

.96 

.84 

1.00 

.97 

.70 

.72 

eL  (S  (xp  )  ) 

WI 

.51 

.77 

.94 

1.00 

1.00 

.84 

.75 

.73 

.96 

WM 

.39 

.66 

.85 

1.00 

1.00 

.78 

.98 

.90 

.86 

LM 

.29 

.49 

.67 

.91 

1.00 

.67 

.66 

.57 

.99 

LN 

.31 

.51 

.67 

.88 

1.00 

.68 

.65 

.57 

1.00 

e(Xp) 

WI 

1.00 

1.00 

1.00 

.89 

.82 

.94 

.97 

.76 

.49 

.52 

WM 

1.00 

1.00 

1.00 

.94 

.82 

.95 

.89 

.88 

.35 

.49 

LM 

.85 

1.00 

1.00 

1.00 

1.00 

.97 

.92 

.95 

.54 

.32 

LN 

.87 

1.00 

1.00 

1.00 

1.00 

.97 

.90 

.98 

.66 

.36 
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Table  3.  (Cont.)  Individual  efficiencies  for  p=l/n 


n 

Measure 

F 

.50 

.75 

1.00 

1.50 

2.00 

ET 

WT 

WP 

LP 

EM 

400 

e(S (Xp) ) 

WI 

.91 

1.00 

.94 

.  77 

.68 

.86 

1.00 

.85 

.59 

.66 

WM 

.91 

1.00 

1.00 

.  77 

.63 

.86 

.83 

.97 

.54 

.65 

LM 

.59 

.82 

.95 

1.00 

.96 

.86 

.93 

1.00 

.53 

.61 

LN 

.70 

.86 

.99 

1.00 

1.00 

.91 

.94 

.97 

.61 

.61 

L  ~ 

e  (S(x  )) 

WI 

.49 

.89 

1.00 

1.00 

1.00 

.88 

.85 

.55 

.88 

WM 

.30 

.63 

.92 

.97 

.87 

.74 

1.00 

.74 

.77 

LM 

.22 

.42 

.64 

1.00 

1.00 

.65 

.79 

.53 

.95 

LN 

.26 

.48 

.71 

1.00 

1.00 

.69 

.78 

.57 

.93 

e(ip) 

WI 

1.00 

1.00 

1.00 

.82 

.69 

.90 

.97 

.77 

.51 

.44 

WM 

1.00 

1.00 

1.00" 

.86 

.67 

.91 

.83 

.92 

.45 

.52 

LM 

.96 

.99 

1.00 

1.00 

1.00 

.99 

.94 

.76 

.44 

.32 

LN 

.87 

.94 

1.00 

1.00 

1.00 

.96 

.93 

.85 

.58 

.34 

800 

e(S(x  )) 

WI 

.85 

.98 

.93 

.  79 

.  70 

.85 

1.00 

.84 

.52 

.58 

WM 

.88 

.98 

.90 

.73 

.64 

.83 

.64 

1.00 

.43 

.57 

LM 

.60 

.76 

.87 

.91 

.86 

.80 

1.00 

.80 

.53 

.53 

LN 

.60 

.83 

•  95 

1.00 

.99 

.88 

1.00 

.99 

.81 

.67 

eL(S(x  )) 

P 

WI 

.44 

.  78 

1.00 

1.00 

1.00 

.84 

.97 

.69 

.85 

WM 

.38 

.70 

.92 

1.00 

.96 

.79 

.96 

1.00 

.74 

LM 

.22 

.40 

.60 

.92 

1.00 

.63 

.84 

.47 

.98 

LN 

.23 

.40 

.57 

.83 

.99 

.60 

.68 

.48 

1.00 

e(x  ) 

D 

WI 

.88 

1.00 

.98 

.80 

.69 

.87 

1.00 

.78 

.48 

.37 

r 

WM 

1.00 

1.00 

1.00 

.80 

.67 

.89 

.68 

.97 

.36 

.40 

LM 

.73 

.81 

.91 

1.00 

.96 

.88 

1.00 

.63 

.53 

.31 

LN 

.88 

.95 

1.00 

1.00 

1.00 

.97 

.96 

.82 

.79 

.32 
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It  is  remarkable  how  slowly  the  behavior  of  the  exponential  tail 
estimator  of  the  quantile  x^n  deteriorates  as  the  heaviness  parameter 
becomes  positive.  The  behavior  of  the  exponential  tail  estimator  of  the 

A  A 

tail  probability  S(x.^n)  relative  to  the  criterion  L1-J.(S(Xyn)  deteriorates 
much  more  rapidly  as  the  heaviness  becomes  positive;  this  is  because  the 
indicated  criterion  severely  penalizes  substantial  underestimates  of  tail 
probabilities.  Observe  that  much  different  behavior  occurs  if  the  criterion 

Ri'(S(xl/n)’  which  severely  penalizes  substantial  overestimates  of  tail 

/\ 

probabilities,  is  used  instead  of  L^.(S(x^n). 

The  above  analysis  of  the  first  five  columns  of  Table  3  suggests  a 
new  procedure  which' hopefully  combines  the  best  features  of  exponential 

/v 

tail  and  Weibull  tail  estimators.  Namely,  first  choose  the  parameter  b 
according  to  the  Weibull  tail  50%  estimator,  and  then  apply  the  exponential 

A  A 

tail  N%  estimator  to  the  transformed  sample  data  X^,...,X^  .  This  two-step 
procedure  is  power  invariant  as  well  as  scale  invariant,  so  its  efficiency 
is  insensitive  to  the  parameter  b.  Thus  in  trying  to  "guestimate"  its 
efficiency  it  can  be  assumed  that  b=l. 

Suppose  the  underlying  distribution  is  Weibull  or  a  Weibull  mixture 

A 

with  b=l.  It  should  then  turn  out  that  b  is  approximately  equal  to  one,  in 
which  case  the  two-step  procedure  should  behave  similarly  to  the  exponential 

tail  N%  procedure.  But  inspection  of  Table  3  shows  that  this  estimator  does 

uniformly  well  for  all  four  sample  sizes  and  all  three  criteria  when  b=l 

and  the  distribution  is  Weibull  or  a  Weibull  mixture.  Suppose  instead  that 

the  underlying  distribution  is  lognormal  or  a  lognormal  mixture.  It  should 
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then  turn  out  that  b  is  approximately  equal  to  .5.  Thus  the  two-step 
procedure  should  behave  similarly  to  the  exponential  tail  N%  procedure 
when  b=2.  Again,  inspection  of  Table  3  shows  that  this  estimator  does 
uniformly  well  when  b=2  and  the  distribution  is  lognormal  or  a  lognormal 
mixture. 

There  is  room  for  further  improvement  when  using  the  two-step 
procedure.  Recall  that  in  Table  2  and  Table  3  for  the  exponential  tail 
N %  estimator,  N%  takes  on  the  values  25%,  10%,  10%  and  5%  respectively 
as  n  ranges  over  the  sample  sizes  100,  200,  400  and  800.  If  larger  values 
of  N%  were  chosen  the  decrease  in  variance  would  generally  not  compensate 
for  the  increase  in  bias.  The  two-step  procedure,  however,  should  be  much 
less  biased  than  the  exponential  tail  procedure,  so  larger  values  of  N% 
may  yield  still  further  improvement  for  the  two-step  procedure. 

The  two-step  procedure  looks  very  promising  at  this  point  in  time 
for  the  reasons  just  described.  But  further  speculation  along  these  lines 
is  unnecessary,  since  it  will  be  included  in  the  next  Monte  Carlo  experiment, 
which  will  be  started  soon. 

2.4  CONCLUSIONS  AND  DIRECTIONS  FOR  FURTHER  RESEARCH 

This  report  has  focused  on  two  procedures,  exponential  tail  and 
Wei  bull  tail,  for  estimating  tail  probabilities  S(x)  and  quantiles  xp  based 
on  a  random  sample  of  size  n  form  an  underlying  distribution.  Interest 
has  centered  around  estimating  S(x)  when  S(x)  =  1/n  and  estimating  xp  when 
P  -  1/n. 


The  theoretical  results  that  were  obtained  in  Sections  4-6, 
mostly  of  an  asymptotic  nature,  suggest  that  both  procedures  should  be 
reasonably  effective  in  estimating  the  quantities  of  interest.  More 
precise  numerical  results  were  obtained  by  means  of  a  Monte  Carlo  experi¬ 
ment.  These  results,  summarized  in  Section  2.3,  confirm  that  both  proce¬ 
dures  are  promising,  but  strongly  suggest  that  substantial  further  improve¬ 
ments  could  be  made  by  properly  combining  them. 

In  the  next  phase  of  this  research,  interest  will  first  be  focused 
on  combining  the  desirable  features  of  exponential  tail  and  Weibull  tail 
procedures  to  obtain  a  two-step  procedure  which  (hopefully)  performs  well 
uniformly  over  all  the  situations  considered  in  Section  2.3.  The  two-step 
procedure  should  lead  to  nearly  unbiased  estimators  in  these  situations. 

If  so,  an  effort  will  be  made  to  attach  approximately  valid  confidence 
intervals  to  these  estimators,  such  confidence  intervals  being  very  desirable 
in  applications;  for  this, the  theory  in  Section  6  should  be  helpful. 

The  two-step  procedure  is  scale  and  power  invariant.  However,  in 
some  situations,  for  example,  when  the  underlying  distribution  is  believed 
to  be  nearly  normal  or  when  a  logarithmic  transformation  is  first  applied 
to  the  sample  data,  it  is  appropriate  to  use  procedures  which  are  location 
and  scale  invariant.  Modifications  of  the  two-step  procedure  which  possess 
these  two  invariance  properties  will  be  studied. 

The  numerical  results  presented  in  Tables  2  and  3  of  Section  2.3 
shown  that  the  efficiency  of  estimators  of  tail  probabilities  is  surprisingly 
sensitive  to  the  choice  of  measure  used  in  evaluating  this  efficiency.  So 
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far  two  measures,  E(S(x)  -  S(x)2  and  E^log 
It  is  desirable  also  to  include  a  measure  which  is  directly  related  to 
the  problem  of  estimating  extreme  value  distributions.  Recall  that  the 
distribution  function  for  the  maximum  of  a  random  sample  of  size  n  from 
the  underlying  distribution  is  of  the  form  (1-S(x))n,  x>0,  which  can  be 

^  n  *  n 

estimated  by  (l-S(x))  ,  x>0.  This  leads  to  the  measure  E((l-S(x))  - 

n  2 

(l-S(x))  )  ,  which  will  be  included  in  the  next  Monte  Carlo  experiment. 

(The  new  measure  should  be  closer  to  E^log  than  to  E(S(x)-S(x) )2. ) 

Finally,  efforts  will  be  made  to  analyze  appropriately  modified 
versions  of  the  best  procedures  when  the  sample  data  is  grouped,  dependent, 
or  nonstationary. 


have  been  employed. 
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3.  MONTE  CARLO  EXPERIMENT 

The  Monte  Carlo  experiment  and  its  results  were  discussed  in  Section 
2.3.  Further  details  are  given  in  Sections  3.1  and  3.2  and  some  additional 
summary  statistics  are  presented  in  Section  3.3. 

3.1  GENERATION  OF  PSEUDO-SAMPLE  DATA 

As  indicated  in  Section  3.2,  for  l<i<4  a  pseudo-random  sample 

X^>...>Xn  from  F^3  was  generated.  Then  for  1  <j <5 ,  xj^>...>xy^  was  used 

as  a  pseudo-random  sample  from  F... 

•  J 

Let  (U^,...,U  )  and (V^ ,. . . ,Vn)  denote  disjoint  sequences  of' 
consecutive  uniform  pseudo-random  numbers  generated  according  to  the 
algorithm  described  in  Lewis,  Goodman  and  Miller  [9].  Disjoint  such 
sequences  were  generated  for  each  pair  (i,n)  and  each  replication. 

(Actually  only  the  U's  were  generated  for  i=l  and  i=3.) 

The  random  sample  corresponding  to  the  standard  exponential  distri¬ 
bution  function  F^3  was  obtained  by  sorting  the  values  -log(l-Um), 
l£m<n.  The  random  sample  corresponding  to  the  exponential  mixture  F^  was 
obtained  by  sorting  the  values 

(f  I(Vm}  “  1og  (1-Um}’ 

where  I(Vm)=0  or  1  according  as  Vm  <.5  or  V„  >  .5. 
m  3  m  m  — 

The  random  sample  corresponding  to  the  lognormal  distribution  function 


F^^  was  obtained  by  sorting  the  values 
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exp(y3  +  a3  4>  1  (Um)),  1<jn<n, 

where  y3  and  a3  are  defined  in  Section  2.3.  The  inverse  of  the  standard 
normal  distribution  function  was  calculated  according  to  (26.2.23)  of 
Abramowitz  and  Stegum  [1].  The  random  sample  corresponding  to  the  lognormal 
mixture  F^3  was  obtained  by  sorting  the  values 

I(Vm)  -  |)  exp  (u3  +a3$_1  (Uj),  l<m<_n. 

3.2  STANDARD  ERRORS  OF  MONTE  CARLO  ESTIMATES 

For  each  estimator  T  of  a  quantity  t,  Monte  Carlo  estimates  of  the 

bias,  standard  deviation  and  root  mean  square  error  were  obtained  based 

on  the  indicated  number  N  of  replications  (e.g.,  N=600  for  n=100).  The 

results,  while  too  extensive  to  report  here,  certainly  helped  lead  to 

the  conclusions  in  Sections  2.3  and  2.4.  Formulas  for  approximating  the 

standard  errors  of  these  Monte  Carlo  estimates  will  now  be  described. 

Let  y  and  a  denote  respectively  the  mean  and  standard  deviation  of 

T  and  set  v2=ET2=y2  +  a2 »  v4=ET4  and  y4=E(T-y)4.  Note  that  Var(T2)= 

2  2  4 

-  v2  and  Var((T-y)  )=y^  -  a  .  Let  T^,...,TN  be  a  random  sample  of  size 
N  from  the  distribution  of  T.  Then  the  quantities  just  defined  can  be 
estimated  by 
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J  =  T  =  Tl  +  •"  +  Tn 


N 


"  4?  <Ti  - 


T)‘) 


2x1/2 


.lrT2/2,;2 

“  N?Ti  -  y  +  a  * 


—  It4 

NVi  * 


and 


“4  -  H-S  <T1  -T> 


N 


=rx2 


The  biasy-t  of  T  is  estimated  by 

Bias  (T)=  y  -  t  =  T  -  t  . 

The  standard  error  of  this  estimate  is  a//N.  This  leads  to  the  approximation 

A 

SE  (Bias  (T))  »  -2-  . 

✓N 

The  standard  deviation  a  of  T  is  estimated  by 

A  A 

SD  (T)  =  a 


Now 


N 


4T(?  -  a2)  -  i  ?  ((T1  -  “>2  '  <,2)  -  ^  <*<T  '  “»2  • 


/N  1 
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p 

so  by  the  central  limit  theorem  the  distribution  of  /N  (a  -  a )  converges 

2 

to  a  normal  distribution  with  mean  zero  and  variance  Var((T  -  y)  )  = 

2 

y4  -  o  .  Consequently  (by  (6a. 2.1)  of  Rao  [13])  the  distribution  of 

A 

•/FT  (a  -  a)  converges  to  a  normal  distribution  with  mean  zero  and  variance 
•  4\  2 

x.  -  a  )/4 a  .  This  leads  to  the  approximation 


SE(SD(T) 1  = 


2  2  1/2 

The  root  mean  square  error  ((y-t)  +  a  )  of  T  is  estimated  by 


A 

RMSE(T)  = 


An  argument  similar  to  that  used  in  the  previous  paragraph  leads  to  the 
approximation 
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3.3  SUMMARY  STATISTICS 

The  summary  statistics  described  briefly  in  Section  2.3  are  given 
explicitly  by 


/,  4  5  -  o\l/2 

R(S(*p»  ■  (s>  I,  l  E1j<S<Xp)  •  S(xp))2j  , 

/,  4  5  -  p  \  1/2 

US(xp))  ■  (sj  .f,  .1  Eij('°g  s<xp>  -  '09  S(xp))2  j  , 


R(*p)  - 


5 

I 


Ei.i(VxP)‘ 


>1/2 


1*'  J=1  Eij(xr(pn)  -  V 


Here  E.  .  denotes  expectation  when  F. .  is  the  underlying  distribution  function. 
*  J  1 J 

In  the  numerical  results  which  follow,  the  indicated  expectations  are  of 
course  replaced  by  their  Monte  Carlo  estimates. 

The  results  for  sample  sizes  n=100,  200,  400  and  800  are  given  in 
Tables  4,  5,  6  and  7  respectively.  In  each  table  p  takes  on  the  indicated 
values  1/50,...,  1/1600.  The  abbreviations  used  for  the  various  estimators 
are:  ET  (exponential  tail),  WT(Weibull  tail),  WML  (Weibull  maximum  like¬ 
lihood),  WP  (Weibull  percentile),  LML  (lognormal  maximum  likelihood), 

LP  (lognormal  percentile),  and  EM  (empirical). 
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Table  4.  Summary  Statistics  for  n=100 


1 

1 

1 

1 

1 

1 

Measure 

Estimator 

1600 

800 

400 

200 

100 

50 

R  ( S  ( x  )  ) 

ET 

5  % 

3.24 

2.43 

1.82 

1.36 

1.02 

0.73 

r 

ET 

102 

2.  6  1 

2.00 

1.5  7 

1.24 

0.9  7 

0.72 

ET 

25% 

2.64 

1.85 

1.36 

1.05 

0.85 

0-69 

WT 

25% 

2.38 

1.69 

1.25 

0.96 

0.76 

0.60 

WT 

50% 

2.45 

1.  75 

1.28 

0.96 

0.7  3 

0.5  5 

WML 

0.98 

0.  90 

0.83 

0.75 

0.6  7 

0.58 

WP 

10% 

2.49 

1.79 

1.32 

1.02 

0.  8  1 

0.68 

WP 

25% 

3.02 

2.  13 

1.54 

1.13 

0.85 

0.64 

LML 

22.49 

13.06 

7.60 

4.4  1 

2.52 

1.40 

LP 

10% 

7.24 

4  55 

2.86 

1.80 

1.14 

0.72 

LP 

25% 

6.8  1 

4.27 

2.69 

1.70 

1.07 

0.68 

EM 

4.13 

3.02 

2.  0  1 

1.4  0 

0.99 

0.  70 

L(S(x  )) 

ET 

5% 

7.7  1 

5.52 

3.89 

2.66 

1.75 

1.0  7 

? 

ET 

10% 

6.92 

4.8  3 

3.30 

2.20 

1.44 

0.  9  1 

ET 

25% 

9.24 

6.34 

4-2  1 

2  69 

1.64 

0.96 

WT 

25% 

26.08 

15.52 

9.07 

5.  14 

2.78 

1.40 

WT 

50% 

5.75 

4.28 

3.11 

2.19 

1.48 

0.95 

WML 

4.74 

3.73 

2.89 

2.  1  9 

1.6  1 

1.15 

WP 

10% 

15.8  1 

11.37 

7.97 

5.4  1 

3.56 

2.  3  1 

WP 

25% 

4.55 

3.45 

2.5  7 

1.87 

1.32 

0.  9  1 

LML 

2.48 

2.10 

1.74 

1.40 

1.07 

0.7  7 

LP 

10% 

2.62 

2.23 

1.84 

1.48 

1.13 

0.  8  1 

LP 

25% 

1.  36 

1.5  7 

1.30 

1.05 

0.82 

0.63 

R(x  ) 

ET 

5%- 

0.84 

0-90 

0-94 

0.84 

0-58 

0.8  3 

P 

ET 

10% 

0.75 

0.79 

0.80 

0.73 

0.5  2 

0.74 

ET 

25% 

0.8  1 

0.82 

0.80 

0.69 

0.48 

0.63 

WT 

25% 

1.59 

1.25 

1.02 

0.73 

0.49 

0.63 

WT 

50% 

1.17 

1.07 

0.95 

0.76 

0.49 

0.6  3 

WML 

0.77 

u  •  8  1 

0.82 

0.74 

0.5  3 

0.6  9 

WP 

10% 

0. 94 

0.99 

1-0  1 

0.93 

0.70 

1.0  7 

WP 

25% 

0.94 

0.95 

0.94 

0.8  1 

0.5  6 

0.77 

LML 

23.78 

17.20 

11.91 

7.10 

3.45 

3.43 

LP 

10% 

9.29 

5.96 

3.55 

1.99 

1.  00 

1.09 

L? 

25% 

4.29 

3.42 

2.58 

1.7  1 

0.  9  1 

0.99 
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Table  5.  Summary  Statistics  for  n=200 


/ 


1 

1 

1 

1 

1 

1 

Measure 

Es  timator 

1600 

800 

400 

200 

100 

50 

R  (S  (x 

)  ) 

ET 

5  % 

1.97 

1.54 

1.2  1 

0.  94 

0.7  1 

0.50 

p 

ET 

10% 

1.67 

1.  30 

1.-0  5 

0.85 

0.68 

0.5  3 

ET 

25% 

2.  05 

1.46 

1.08 

0.83 

0.65 

0.  5  1 

WT 

25% 

1.63 

1.2  1 

0.92 

0.  7  1 

0.5  5 

0.42 

WT 

50% 

1.64 

1.22 

0.  92 

0.70 

0.5  3 

0-40 

WML 

0.  88 

0.  84 

0.79 

0.72 

0.64 

0.54 

WP 

10% 

1.56 

1.17 

0.90 

0.7  1 

0.5  7 

0.48 

WP 

25% 

1.  74 

1.31 

1.00 

0.  78 

0.60 

0.47 

LML 

21.90 

12.76 

7.44 

4.32 

2.47 

1.36 

LP 

10% 

4.03 

2.62 

1.7  1 

1  11 

0.72 

0.47 

LP 

25% 

4.54 

2.89 

1.  84 

1-17 

0.74 

0.47 

EM 

2.81 

1.99 

1.3  7 

1.0  1 

0.70 

0.48 

L(S  (x 

)  ) 

ET 

5% 

4.06 

2.  80 

1.  9  C 

1.26 

0.83 

0.  53 

P 

ET 

10% 

4.59 

3.08 

2.00 

1.27 

0-80 

0.52 

ET 

25% 

7.44 

5.01 

3.23 

1.98 

1.13 

0.62 

WT 

25% 

4.1  3 

2.96 

2.0  7 

1.38 

0.88 

0.54 

WT 

5  0% 

2.34 

1.81 

1.36 

0.99 

0.70 

0.48 

WML 

4.  50 

3.51 

2.69 

2.02 

1.47 

1.03 

WP 

10% 

3.08 

2.40 

1.85 

1. 40 

1.07 

0.84 

WP 

25% 

2.59 

1.99 

1.50 

1.10 

0.80 

0.58 

LML 

2.47 

2.09 

1.73 

1.38 

1.05 

0.75 

LP 

10% 

1.57 

1.32 

1.08 

0.86 

0.65 

0.48 

LP 

25% 

1.46 

1.  20 

0.97 

0.7  5 

0.5  7 

0.43 

R  ( x  ) 

ET 

5% 

0.76 

0.  79 

0.72 

0.52 

0.7  3 

0.96 

P 

ET 

10% 

0.73 

0.73 

0.65 

0.46 

0.63 

0.86 

ET 

25% 

0. 90 

0. 89 

0.78 

0.52 

0.63 

0-72 

WT 

2  5% 

1.02 

0.  9  1 

0.74 

0.49 

0.  6  1 

0.  7  1 

WT 

50% 

0.85 

0.82 

0.69 

0.47 

0.60 

0.  7  1 

WML 

0.  91 

0.  95 

0.87 

0.63 

0.85 

1.02 

WP 

10% 

0. 80 

0.82 

0.7  5 

0.54 

0.78 

1.08 

WP 

25% 

0.79 

0.80 

0.7  1 

0.5  1 

0.63 

0.86 

LML 

23.35 

17.35 

11.13 

5.74 

5.62 

4.7  7 

LP 

10% 

2.82 

2-17 

1.5  1 

0.8  7 

1. 00 

1.0  1 

LP 

25% 

2.85 

2.38 

1.71 

0-9  7 

1.03 

1.02 
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Table  6*  Summary  Statistics  for  n=400 


1 

1 

1 

1 

1 

1 

Measure 

Es  timator 

I  600 

800 

400 

200 

100 

50 

R (S (x  )  ) 

ET 

5% 

1.31 

1  06 

0.87 

0.70 

0.53 

0.37 

P 

ET 

10% 

1.27 

0.98 

0.78 

0.63 

0.  5  1 

0.  40 

ET 

25% 

'  1.80 

1.29 

0.95 

0.72 

0.53 

0.38 

WT 

25% 

1.22 

0.93 

0.72 

0.55 

0.  4  1 

0.  29 

WT 

50% 

1.22 

0.93 

0.  7  1 

0.54 

0.40 

0.29 

WML 

0.86 

0.83 

0.78 

0.71. 

0.63 

0.53 

WP 

10% 

1.12 

0.88 

0.70 

0.56 

0.45 

0.  38 

WP 

25% 

1.16 

0.90 

0.  7  1 

0.56 

0.44 

0.35 

LML 

21.50 

12.56 

7.34 

4.26 

2.44 

1.34 

LP 

10% 

2.64 

1.75 

1.16 

0.76 

0.50 

0.33 

LP 

25% 

3.57 

2.25 

1.41 

0.87 

0.53 

0.32 

EM 

•  2.07 

1.47 

1.03 

0.73 

0.  5  1 

0.35 

L  ( S  ( x  )  ) 

ET 

5% 

2.  99 

1.  98 

1.29 

0.83 

0.54 

0.35 

P 

ET 

10% 

3.83 

2.49 

1.54 

0.92 

0.55 

0.36 

ET 

25% 

6.85 

4.56 

2.89 

1.7  1 

0.  9  1 

0.45 

WT 

25% 

2.13 

1.58 

1.  1  3 

0.78 

0.52 

0.33 

WT 

50% 

1.51 

1.16 

0.88 

0.64 

0.45 

0.  3  1 

WML 

4.48 

3.48 

2.66 

1.98 

1.43 

0.98 

WP 

10% 

2.  12 

1.  65 

1.26 

0.96 

0.72 

0.5  6 

WP 

25% 

2.08 

1.5  7 

1.15 

0.83 

0.59 

0.42 

LML 

2.46 

2.09 

1.73 

1.38 

1.05 

0.74 

LP 

10% 

1.21 

1.00 

0.  8  1 

0.63 

0.48 

0.35 

LP 

25% 

1.27 

1.02 

0.79 

0.59 

0.42 

0.30 

R(i  ) 

ET 

5% 

0.75 

0.68 

0.48 

0.64 

0.  8  1 

1.09 

P 

ET 

10% 

0.79 

0.70 

0.48 

0.59 

0.70 

0-98 

ET 

25% 

1. 08 

0.98 

0.67 

0.77 

0.80 

0.82 

WT 

25% 

0. 86 

0.72 

0.48 

0.  6  1 

0-68 

0.78 

WT 

50% 

0.77 

0.69 

0.47 

0.5  9 

0.6  7 

0.79 

WML 

1.13 

1.09 

0.  8  1 

1.03 

1.26 

1.45 

WP 

10% 

0.  81 

0.73 

0.  5  1 

0.70 

0.87 

1.22 

WP 

25% 

0.  78 

0.69 

0.48 

0.63 

0.76 

0.9  7 

LML 

24.  2  7 

16.41 

8.67 

8.75 

8.29 

6.89 

LP 

10% 

1.86 

1.44 

0.86 

0.98 

1.00 

1.05 

LP 

25% 

2.87 

2.24 

1.28 

1. 24 

1.16 

1.06 
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Table  7.  Summary  Statistics  for  n=*800 


1 

1 

1 

1 

1 

1 

Measure 

Es  timator 

1600 

800 

400 

200 

100 

50 

R  ( S  ( x  )  ) 

ET 

5% 

0.92 

0.  75 

0.62 

0.  5  1 

0.40 

0.  2  8 

V 

ET 

10% 

1.02 

0.80 

0.62 

0.48 

0.38 

0.32 

ET 

25% 

1.67 

1.20 

0.89 

0.67 

0.47 

0.30 

WT 

25% 

0.  90 

0.  70 

0.54 

0.40 

0.29 

0.  2  1 

WT 

50% 

0. 95 

0.  7.2 

0.54 

0.40 

0.29 

0.  2  1 

WML 

0.85 

0.82 

0.78 

0.7  1 

0.62 

0.52 

WP 

10% 

0.82 

0.67 

0.55 

0.44 

0.36 

0.29 

WP 

2  5% 

0.83 

0.68 

0.55 

0.43 

0.34 

0.2  7 

LML 

21.27 

12.45 

7.29 

4.24 

2.43 

1.33 

LP 

10% 

1.71 

1.  14 

0.  75 

0.50 

0.33 

0.  23 

LP 

25% 

2.99 

1.87 

1.15 

0.69 

0.39 

0.23 

EM 

1.48 

1.04 

0.72 

0.49 

0.35 

0.24 

L(S  ( x  )  ) 

ET 

5% 

2.  19 

1.37 

0.84 

0.52 

0.36 

0.25 

F 

ET 

10% 

3.27 

2.06 

1.22 

0.67 

0.38 

0.27 

ET 

25% 

6.36 

4.  1  9 

2.62 

1.51 

0.76 

0.33 

WT 

25% 

1.10 

0.84* 

0.63 

0.45 

0.  3  1 

0.  2  1 

WT 

50% 

1.02 

0.78 

0.58 

0.42 

0.30 

0-  2  1 

WML 

4.42 

3.43 

2.  6  1 

1.  94 

1.39 

0.95 

WP 

10% 

1. 48 

1.14 

0.86 

0.65 

0.49 

0.  38 

WP 

25% 

1.  78 

1.32 

0.95 

'0-6  6 

0.45 

0-32 

LML 

2.47 

2.09 

1.73 

1.38 

1.05 

0.74 

LP 

10% 

0.89 

0.  7  1 

0.56 

0.43 

0.32 

0.24 

LP 

25% 

1.18 

0.93 

0.70 

0.50 

0.33 

0.22 

R  (xp) 

ET 

5% 

0.6  1 

0.41 

0.55 

0.68 

0.95 

1.19 

ET 

10% 

0.72 

0.47 

0.60 

0.6  7 

0.  8  1 

1.05 

ET 

25% 

1.12 

0.77 

0.98 

1.07 

1.09 

0.92 

WT 

25% 

0.  6  1 

0.42 

0.5  7 

0.67 

0.  78 

0.80 

WT 

50% 

0.62 

0.42 

0.56 

0.65 

0.78 

0.82 

WML 

1.17 

0.86 

1.22 

1.5  1 

1.84 

2.15 

WP 

10% 

0.66 

0.46 

0.65 

0.80 

1.0  1 

1.2  7 

WP 

25% 

0.67 

0.45 

0.6  1 

0.73 

0.  90 

1.09 

LML 

22.13 

12.00 

13.  88 

13-9  1 

13-24 

10.5  1 

LP 

10% 

1.25 

0.76 

0.  9  1 

0.95 

1.00 

0.98 

LP 

25% 

2.38 

1.41 

1.59 

1.5  1 

1.34 

1.03 

42 


The  results  for  R(x  )  again  demonstrate  that  exponential  tail  estimators 
do  suprisingly  well  from  an  overall  viewpoint  in  estimating  the  p  quantile 
xp  for  small  values  of  p.  Wei  bull  tail  estimators  do  reasonably  well  when 

A  /V  A 

the  three  criterion  R(S(x  )),  L(S(x  ))  and  R(x  )  are  considered  together. 

r  r  r 

The  two-step  procedure  described  in  Section  2.3  will  hopefully  do  sub¬ 
stantially  better. 

Recall  that  T(pn)  is  the  smallest  integer  which  is  greater  than  or 
equal  to  pn,  so  that  X|-(pn)  is  an  emPlr1ca^  estimator  of  xp.  When  is 
the  underlying  distribution  function,  the  relative  mean  square  error  of 
this  estimator  is  given  by 


Monte  Carlo  estimates  of  the  quantity 


5 

I 


i=l  j=l 


E. 


1J 


as  a  function  of  n  and  p  are  shown  in  Table  8.  It  is  interesting  to  note 
that  for  each  value  of  n,  this  quantity  is  largest  when  p=l/n. 
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TabLe  8 . 

Summary 

Statistics 

for 

xr(pn)-*0j2 

1 

1 

1 

1 

1 

1 

n 

1600 

800 

400 

200 

100 

50 

100 

0.46 

0.43 

0.44 

0.58 

0  90 

0.48 

200 

0.40 

0.42 

0.53 

0.8  1 

0.39 

0-25 

400 

0.37 

0.45 

0.67 

0.34 

0.24 

0.  15 

800 

0.41 

0.  60 

0.32 

0.21 

0.  1  4 

0.  1  1 

44 


4.  TRANSFORMED  EXPONENTIAL  TAIL  ESTIMATORS 


In  Section  4.1  transformed  exponential  estimators  will  be  defined 
in  general  and  Theorem  4.1  will  be  obtained,  which  can  be  used  to  verify 
the  existence  and  uniqueness  of  such  estimators.  In  Section  4.2  this 
theorem  will  be  applied  to  verify  the  existence  and  uniqueness  of  Wei  bull 
tail  estimators.  In  Section  4.3  it  will  be  applied  to  several  other  spe¬ 
cific  transformed  exponential  tail  estimators. 


4.1  GENERAL  FORM 

Let  Xg  be  a  positive  number  and  let  Ty  (x),  x  >_  Xg,  be  a  continuous 

strictly  increasing  function  of  x  such  that  T  (xn)  =  0  and  lim  T  (x)= 
_1  x0  u  x^°°  x0 
Let  T  (y),  y  0,  denote  the  corresponding  inverse  function.  Two  examples 
x0 

of  such  functions  T  are  given  by  T  (x)  =  x  -  xn  and  T  (x)  =  loo(x/xn). 

x0  u  x0 

In  order  to  motivate  the  form  of  the  estimators  that  will  be  consi¬ 
dered,  suppose  first  that  the  conditional  distribution  of  T  (x)  given  that 

x0 

X  >  Xg  is  exponential  with  mean  a.  That  is,  suppose  that 

P(T  (X)  >  y | X  >  xn)  =  e"y/a,  y  >  0. 

x  g  u 

Then 


P(X>X|X>Xg) 


=  P(T  (X) 
x0 


>  T  (x)|X>xn) 
x0  u 


exp[-T  ( x ) / a] ,  x  >  xn, 

x0  u 


and  hence 


S(x)  =  S ( x g )  exp[-T  (x)/a],  x  >  xn 

U  Xg  u 


(4.1) 
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In  general,  let  S(xg)  be  an  estimate  of  S(xQ)  and  let  a  be  an  estimate 

A 

of  a.  Then  (4. 1 ) suggests  the  estimate  S(x)  of  S(x)  given  by 

S(x)  =  S(xQ)  exp[-T  (x)/a],  x  >  xQ.  (4.2) 

Associated  with  (4.2) is  the  corresponding  estimate  of  xp  given  by 

-1,-  S(x  ) 

Xp  =  T^(a  log  -^-) ,  0<P<S(xQ).  (4.3) 

Estimators  of  the  form  given  by  (4.2)  and  (4.3)  are  called  transformed 
exponential  tail  estimators. 

Let  the  values  obtained  in  a  random  sample  of  size  n  from  the 
distribution  of  X  be  written  in  decreasing  order  as  X^>  ...  >Xn>  The 
number  Xg  is  allowed  to  depend  on  this  data;  indeed  in  practice  Xg  =  X  +i 
for  some  integer  m  such  that  2<m<n.  The  tail  probabilities  S(x), 
x^Xg,  can  be  estimated  by 

S(x)  =  1  #{i:X.  >  x},  0  <  x  <  Xg.  (4.4) 

(Here  #1  denotes  the  number  of  points  in  the  set  I.)  In  particular 

S(x0)  =  1  #{i:X.  >  Xg}.  (4.5) 

Suppose  that 


m  =  #{i  :X..  >  Xg)  >  2. 


(4.6) 
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Maximum  likelihood  considerations  suggest  estimating  a  by 


(Xf). 

m  i  Xq  ' 


(4.7) 


It  remains  to  complete  (4.3)  by  defining  xp  for  p>S(xQ).  This  will  be 
done  by  setting 


*P  “  XT (np). 


S(xQ)  <  p  <  1  , 


(4.8) 


where  f(y)  denotes  the  smallest  integer  which  equals  or  exceeds  y. 


Given  x  satisfying  (4.6)  and  given  T  ,  formulas  (4.2)-(4.8)  determine 

0  Xq 

completely  the  estimators  of  the  tail  probabilities  S(x),  x>0,  and  quantiles 
x  ,  0<p_<l.  It  remains  to  determine  xn  and  T  . 

P  U  Xq 

If  X  has  an  exponential  distribution  with  mean  a,  the  conditional 

distribution  of  X  -  xQ  given  that  X >  xQ  (xQ  here  being  a  fixed  positive 

number)  is  exponential  with  mean  a.  Thus  a  natural  choice  of  T  is  given 

x0 

by  T  (x)  =  x  -  xn.  For  this  choice,  equations  (4.7),  (4.2)  and  (4.3) 

Xq  u 

become  respectively 


9  —  BXi  -  Xq), 


(4.9) 


S(x)  =  S(xq)  e 


(x-x0)/a 


x>x0, 


(4.10) 


A.  /v  S  (  Xq  ) 

xp  =  xo  +  a  l09  — 


0<  PIS(Xq) 


(4.11) 
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Estimators  of  the  form  given  by  (4.10)  and  (4.11)  are  called  exponential 
tai 1  estimators. 

If  log  X  has  an  exponential  distribution  with  mean  a  (so  that  X 

itself  has  a  Pareto  distribution),  the  conditional  distribution  of 

1 og ( X/Xq )  given  that  X >  xq  is  exponential  with  mean  a.  Thus  another 

natural  choice  of  T  is  given  by  T  (x)  =  log(x/xn).  For  this  choice, 
x0  x0 

equations  (4.7),  (4.2)  and  (4.3)  become  respectively 

A  -I  PI  X  • 

a  =  l£log-l  (4.12) 

m  1  xQ 

A 

S(x)  =  S(xQ)(^)1/a,  x>xQ,  (4.13) 

and 

A 

xp  =  x0|S(x0  ,  0<p<S(x0)  •  (4.14) 

Estimators  of  the  form  given  by  (4.13)  and  (4.14)  are  called  Pareto  Tail 

estimators.  Such  estimators  were  considered  by  Hill  [7], 

If  Xb  has  an  exponential  distribution  with  mean  a  for  some  b>0 

(so  that  X  itself  has  a  Weibull  distribution),  the  conditional  distribution  of 

Xb  -  xb  given  that  X >  Xg  is  exponential  with  mean  a.  Thus  a  third  natural 

choice  of  T  is  given  by  T  (x)  =  xb  -  x~.  This  choice  will  be  treated  in 
x0  x0  u 
Section  2.2. 

The  last  example  suggests  choosing  a  family  of  transformations 
T  (x,b),  x  >  xn,  where  b  ranges  over  some  interval  and  is  chosen  adaptively 

Xg  u 

from  the  sample  data.  This  approach,  in  comparison  to  that  of  choosing  a 


t 


fixed  transformation,  is  applicable  to  a  wider  range  of  the  unknown  underlying 
distribution,  but  it  requires  a  larger  sample  size  to  be  reliable. 
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In  implementing  this  approach,  there  is  a  variety  of  reasonable  ways 

A 

to  choose  a  value  b  for  b  based  on  the  data  ,  l<i<m,  where  m  is  given  as 

A  A 

before  by  (4.6).  Given  b,  it  is  natural  to  choose  a  as  in  (4.7),  so  that 

f 

a  =  l  XX  (X  ,  b).  (4.15) 

m  i  xo  1 

Hill  [4]  suggested  choosing  b  by  the  maximum  likelihood  method.  According 

A 

to  this  method  b  should  be  chosen  to  maximize 

nVVS) 

n  — -  exp[-T  (X  .,  b)/a] , 

la  x0  1 

A  A 

where  a  is  defined  in  terms  of  b  according  to  (4.15)  and  t  (x,b)  =  3T  (x,b)/3x. 

x0  0 

In  a  different  context,  Box  and  Cox  [2]  had  earlier  suggested  choosing  the 
parameters  of  a  transformation  by  the  maximum  likelihood  method.  But  there  is 
no  compelling  reason  for  this  choice  unless  the  corresponding  theoretical 
model  is  really  believed  to  be  valid. 

A 

Another  approach  to  choosing  b  is  to  make  some  feature  of  the  empirical 

A  /V 

distribution  of  T  (X.,  b)/a,  l<i<m,  match  up  with  the  correspondina  feature 
x0 

of  the  standard  exponential  distribution.  The  work  of  Pickands[l2]  suggests 
matching  up  some  fixed  quantile.  The  test  for  exponential ity  due  to  Shapiro 
and  Wilk  Q4]  (see  also  Stephens  [15])  suggests  making  the  empirical  second 

A  A 

moment  of  T  (X.,  b)/a,  l<i<m,  equal  to  the  second  moment  of  the  standard 
x0  1 

exponential  distribution,  namely  2.  According  to  this  criterion,  b  should 

A 

be  chosen  to  satisfy  the  equation  U(b)  =  2,  where 
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U(b)  = 


i  m  o 

-  tr  (x.,  b) 
m  1  x0  1 
1  m  « 

<i£yvb» 


This  is  the  criterion  which  will  be  studied  in  the  present  report. 

In  using  this  criterion  it  is  important  to  determine,  for  a  given 

A 

family  of  transformations,  when  a  solution  to  U(b)  =  2  exists  and  when  it 
is  unique.  That  this  determination  can  be  made  for  a  variety  of  natural 
families  of  transformations  is  one  of  the  main  justifications  for  the 
criterion  under  consideration. 

A 

In  analyzing  the  equation  U(b)  =  2  it  is  convenient  to  define  U  in  a 

more  general  manner.  To  thisend,  think  of  X.,  l<i<m,  as  fixed  numbers  and 

let  X  denote  a  random  variable  whose  distribution  is  the  same  as  the 

empirical  distribution  of  X.,  l<i<m.  Also  set  T(x,  b)  =  T  (x,  b), 

1  ~x0 

temporarily  ignoring  the  dependence  on  xQ,  and  set  T  =  T(X,  b).  Then 
U(b)  =  ET2/(ET)2. 

More  generally,  let  X  denote  an  arbitrary  ( i . e . ,  not  necessarily 

having  distribution  function  F)  nonconstant  random  variable  and  let  I  be 

an  interval  such  that  P(X€l)  =  1.  Let  J  be  an  open  interval  and  let  T(x,b) 

be  a  strictly  increasing,  strictly  positive  continuous  function  of  x  on 

I  x  J  such  that  3T(x,b)/3b  exists  and  is  continuous  in  x  on  I  x  J.  Set 

T  =  T(X,b)  and  T'  =  3T(X,b)/9b.  Suppose  that  T2,  T'  and  TT'  have  finite 

? 

expectations  and  that  dET/db  =  ET '  and  dET  /db  =  ETT'  (i.e.,  that  differen¬ 
tiation  and  expectation  can  be  interchanged  here).  Define  U  as  before  by 
U ( b )  =  ET  /(ET)  ,  b*0.  If  the  interval  J  is  not  compact,  but  U  is  monotonic 


on  this  interval,  let  it  be  defined  by  continuity  as  a  finite  or  an  infinite 
number  at  the  missing  finite  for  infinite  end  points  of  J. 

The  following  theorem  provides  a  sufficient  condition  for  there  to 

A  A 

be  at  most  one  point  b<J  such  that  U(b)  =  2. 


Theorem  4, 1 .  Suppose  the  above  conditions  hold  and  that  51og  T(x,b)/3b 
is  strictly  increasing  (decreasing)  in  x  on  I  x  J.  Then  U  is  continuous 
and  strictly  increasing  (decreasing)  and  has  strictly  positive  (negative) 
derivative  on  J. 

Proof.  Observe  that 


is  strictly  increasing  in  x  on  I  x  J.  (A  similar  argument  works  when  this 
function  is  strictly  decreasing.)  It  suffices  to  prove  that 

ET(T'  -  T  |J4)  >0  on  J. 

Choose  b€j.  Since  X  is  a  nonconstant  I-valued  random  variable,  there  is 

t 

an  XqCJ  such  that 

xtl  fl  (xQ,«), 


T(x,b)  >  |y-  T(x,b), 


and 

^  T(x,b)  <  |f-  T(x,b) ,  x«I  f|  (-°°,  xQ). 

Now 

E ( T ( T '  -  T  |f-);  X>x0)>T(x0,b)E(T'  -  T  |f-;  X>xQ) 
and 

E(T(T'  -  T  |f-);  X<xQ)  >T(x0,b)E(T'  -  T  |f-;  X<xQ). 

(Here  E(Z;  A)  =  E ( Z  1^),  where  equals  1  or  0  according  as  the  event  A  does 
or  does  not  occur.)  Note  that  strict  inequality  holds  in  the  first  (or  second) 
of  these  two  inequalities  unless  P(X>Xg)  =  0  (or  P(X  <  Xg)  =  0).  Since  X 
is  a  nonconstant  random  variable 

E(T(T  -  T*  |f-))  >  0 

as  desired. 

4.2  WE  I  BULL  TAIL  ESTIMATORS 

Theorem  4.1  will  now  be  applied  to  the  family  of  transformations  given 
by  T  (x,b)  =  xb-x~.  Here  X  is  an  arbitrary  random  variable  and  U  is 

Xg  u 

defined  in  terms  of  T  as  before. 

Theorem  4.2.  Let  0<Xg<°°  and  let  X  be  a  nonconstant  random  variable  such 
that  P ( X > x g)  =  1  and  bg  =  sup[b:EX^<°°]>0.  Set  T(x,b)  =  x^-Xg  for  x>Xg  and 
0<b<bg.  Then  U  is  continuous  and  strictly  increasing  on  (0,°°)  and 


52 


In  particular  there  is  at  most  one  positive  solution  b  to  the  equation 

A 

U(b)=2.  There  is  such  a  solution  if  and  only  if 
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(4.17) 


Suppose  (4.17)  holds.  Then  (4.15),  (4.2)  and  (4.3)  become  respectively 


-  -  -(xb-xb)/a 

S(x)  =  S(xQ)  e  0 


x>x 


O’ 


(4.18) 

(4.19) 


and 


XP  =  (x 


D  "  S(XC)) 

S +  a  log  -r- 


J/b 


0<P<S(Xg) 


(4.20) 


Estimators  of  the  form  given  by  (4.19)  and  (4.20)  are  called  Weibull  tail 

0 

estimators.  Some  of  their  asymptotic  properties  will  be  obtained  in 
Section  5. 

Proof  of  Theorem  4.2.  It  will  first  be  shown  that  slog  T(x,b)/sb 
is  strictly  increasing  in  x  for  x>xQ  and  b>0.  If  xQ=0,  then  Slog  T(x,b)/sb= 
log  x  and  hence  the  desired  result  is  valid. 

Suppose,  next  that  Xg>0.  Then 


^  log  T(x,b) 


b,  b  , 

x  log  x  -  Xg  log  Xg 


z  log  z  -  Zg  log  Zg 
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I 


(f  log  f 

1  /  0 _ fo_ 

-S--  1 


+  log  z. 


where  z  =  xb>Xg  =  Zg  for  x>Xg.  Thus  is  suffices  to  show  that  z  log  z/(z-l) 
is  strictly  increasing  in  z  for  z>l .  Now 


so  it  suffices  to  show  that 


d  /z  log  z\_  log  z  +  1 
dz  \  z  -  1  /  ~ 


T^T 


Z  loq  z 


( z- 1 ) ('log  z  +  1)  -  z  log  z>0  for  z>l 

or,  equivalently,  that  log  z  <  z-1  for  z>l .  The  last  inequality  is 
obviously  true  since  dlog  z/dz  =  z"^<l  for  z>l . 

Therefore,  in  general,  Tlog  T(x,b)/8b  is  strictly  increasing  in 
x  for  x>0  and  b>0.  Thus  by  Theorem  4.1,  II  is  continuous  and  strictly 
increasing  on  (0,bg). 

If  Xq=0,  then  lim^g  EX*^  lining  EX^  =  1  and  hence 


EX 


2b 


U(0)  =  1  im  - -V a  =  1 
b-K)  (ExV 


Consequently  (4.16)  holds  when  Xg=0. 

Suppose  instead  that  Xg>0.  Since 


U(b)  = 


m-’)‘ 


im 


'-TV 


P/- 


•V. 
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in  verifying  (4.16)  it  can  be  assumed  that  xQ=l  and  hence  that  P(X>1)=1. 
Observe  that  if  x>l ,  then 


and 


1  im 
t>+0 


log  x 


e-1 


for  0<b<e. 


Thus  by  the  dominated  convergence  theorem 


lim  F/xb-l\  _ 

b-0  .  \  b  / 


Elog  X 


and 


1  im  E 
b-0 


=  El ogc  X. 


Consequently  (4.16)  holds  when  Xg  =1. 

Suppose  bg=°°.  Suppose  also  that  (i)  there  is  an  M>xQ  such  that 
P ( X <M )  <  P(X<M)  =  1 .  Then 


lim  E 
b->-  * 


((£)b  -  (t)  )  *  "n„E((l)b;  X<M)  +  P(X=H> 


and  similarly 


lim  E 
b-»  “> 


=  P(X=M) 


((sf  -  G?)b) 


=  P(X=M) 
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Therefore 


U(°°)  =  lim 

My  -  tef 

f 

b*  00 

E//x\b  -/xo\  \ 

2 

((*)  ItJ  )J 

1 

P(X=M) 


Suppose  instead  that  (i)  fails  to  hold.  Choose  e>0  and  let  M>xQ  be  such 
that  0<P(X>M)  <_  e.  Then 


E(Xb  -  xb;  X>M) 
E(Xb  -  xb) 


and 


1  im 
b*  °° 


E ( ( Xb  -  xb)2;  X>M) 
E(Xb  -  xb)2) 


1. 


By  Schwarz's  inequality 


E((Xb 


X>M)  > 

> 


[E((Xb-xb);  X>M)]2 
P(X>M) 

[E((Xb-xb);  X>M)]2 
e 


Consequently 

U(°°) 


E((Xb-xb)2;  X>M) 

1  i  m  - r - r - x— 

b-  »  [E(Xb-Xg,  X>Mr 


Therefore  U(°°)  =  °°,  which  completes  the  proof  of  the  theorem. 
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4.3  OTHER  SPECIAL  CASES 


Theorem  4.1  will  now  be  applied  to  several  other  elementary  families 
of  transformations.  The  first  such  family  covers  situations  in  which  (4.17) 
fails  to  hold. 

Theorem  4.3  Let  Xq>0  and  let  X  be  a  noncohstant  random  variable 
such  that  P(X>Xq)=1  and  Elog2  X<°°.  Set  T(x,b)=log(l+b 


x-x, 


0 


-)  for  x>xn 
x0  u 


and  b>0.  Then  U  is  continuous  and  strictly  decreasing  on  (0,°°).  If 
EX2<°°,  then  U(0)=E(X-x0)2/(E(X-xQ) )2. 


Proof  Observe  that 


x-xr 


log  T(x,b)  = 


0 


x-xn  x-xn 

-V^)  log  (1+b  -r1) 


(1+b 


0 


'0 


Thus  to  prove  that  3log  T(x,b)/8b  is  strictly  decreasing  in  x  for  x>Xq, 
it  suffices  to  show  that 

(1+x)  log  (1+x) 
x 


is  strictly  increasing  in  x  for  x>0.  But  this  result  was  verified  in  the 
proof  of  Theorem  4.2.  The  conclusions  of  the  present  theorem  now  follow 
easily  from  Theorem  4.1  and  the  dominated  convergence  theorem. 
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Consider  the  family  of  transformations  given  by 

X-  X 

T  (x,b)  =  log(l+b  — — )  ,  x>x  , 
x0  x0  0 


where  b>0.  Then 


U(b) 


1  9  X  •  "X« 

1E,os2 

m  1  xQ 

i  X  •  “ X/-»  o 

log  (1+b  ^))2 


1 


'0 


By  Theorem  4.3,  U  is  continuous  and  strictly  decreasing  on  (0,«),  U(°°)=l 
and 


Thus  there  is  at  most  one  positive  solution  to  the  equation  U(b)=2,  and 
there  is  such  a  solution  if  and  only  if 


(4.21) 


Observe  that  the  left  side  of  (4.21)  is  larger  than  the  left  side  of  (4.17), 
so  that  (4.17)  and  (4.21)  cannot  simultaneously  fail.  Suppose  (4.21)  holds. 
Then  (4.15),  (4.2)  and  (4.3)  become  respectively 
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t  m  X.-x„ 

a  =  iElog  (1+b  -J-2), 


m 


1 


(4.22) 


'0 


and 


S(xn) 

S(x)  =  -  0 


^  X-X.  ! r  ’ 

(1+b  — V/a 

X0 


X±V 


(4.23) 


xp  =  x0  (  +  1-i),  0<p<S(x0)  •  (4. 


24) 


Estimators  of  the  form  given  by  (4.23)  and  (4.24)  are  called  Pareto  tail 

A 

estimators.  When  b=l  they  reduce  to  the  forms  given  by  (4.13)  and  (4.14). 
Suppose  (4.17)  and  (4.21)  both  hold;  i.e.,  that 


i  m  o  X  • 

iU>g2  ji 

m  xQ 


<2< 


i  m 

5^(*r 


x0) 


i£<VX( 


,))‘ 


(4.25) 


Then  there  is  a  Weibull  tail  estimator  for  a  unique  choice  of  b,  with 
'**  /\ 
0<b<l ;  and  there  is  a  Pareto  tail  estimator  for  a  unique  choice  of  b, 

A 

again  with  0<b<l.  When  (4.25)  holds  it  seems  reasonable  to  choose  the 

As 

method  which  corresponds  to  the  larger  value  of  b. 

There  are  some  other  elementary  families  of  transformations  for 
which  Theorem  4.1  is  applicable. 
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Theorem  4.4.  Let  c^l  be  a  positive  constant.  Let  x  >0  and  let  X  be 

c 

2c  2 

a  nonconstant  random  variable  such  that  P(X>xQ)  =  1 ,  EX  <°°  and  EX  <°°. 

Set  T(x,b)  =  (x+b)c  -  (xg+b)c  for  x>xg  and  b>Xg.  Then  U  is  continuous 
on  [-x0,«). 


and 


E(X-xq)2c 

U(-x0)  =  - j  » 

( E ( X-x  0) C ) 


U(°°) 


E(X-x0)2 
(E(X-Xg) )2 


Also  U  is  strictly  increasing  or  strictly  decreasing  on  [-Xg,°°)  according 
as  0<c<l  or  c>l . 

Proof.  It  is  clear  that  U  is  continuous  on  [-Xg,°°)  and  that  U(-Xg) 
has  the  indicated  value.  The  value  for  U(°°)  is  easily  verified  by  using 
the  dominated  convergence  theorem  and  the  formula 


Observe  that 


b((l+  f) 


-’)■/ 


x  c-1 

c(l+  f) 


dy. 


^  log  T(x,b) 


c 

b 


c-1 

-1 

(^+DC  -  1 


A  simple  computation  yields  that 
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d  /xC_1-l 


c-2 


dx»  xc-l  /  (xc-l)2 


(-*  +  cx  -  (c-1 ) ) 


It  follows  easily  from  this  the  formula 


^  (-xc  +  cx  -  (c-1))  =  c ( 1  -  xc_1) 
c- 1  c 

that  (x  - 1 ) / ( x  - 1 )  is  strictly  increasing  or  strictly  decreasing  on  [1 ,°°) 
according  as  0<c<l  or  c>l.  The  last  conclustion  of  the  Theorem  now  follows 
from  Theorem  2.1. 

Theorem  4.5.  Let  Xq>0  and  let  X  be  a  nonconstant  random  variable 
such  that  P(X>xQ)=l  and  EX2«».  Set  T(x,b)  =  log(x+b)~log(x0+b)  for  x>xQ 
and  b>-Xg.  Then  U  is  continuous  and  strictly  increasing  on  (-Xq,°°), 

U(-xQ)  =  1  and  U(«)  =  E(X-xQ)2/E(X-x0) )2. 

The  proof  of  this  theorem  is  similar  to  that  of  Theorem  4.3  and  left 
to  the  reader.  It  is  also  straightforward  and  left  to  the  reader  to  analyze 
the  transformed  exponential  tail  estimators  corresponding  to  the  transforma¬ 
tions  defined  in  the  statements  of  Theorems  4.4  and  4.5. 
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5.  ASYMPTOTIC  BEHAVIOR  OF  WEIBULL  TAIL  ESTIMATORS 

In  this  section  the  asymptotic  behavior  of  Weibull  tail  estimators 
will  be  studied  in  a  modified  setting  where  the  analysis  is  less  complicated. 
The  results  will  be  described  in  Section  5.1  and  proven  in  Section  5.2. 

5.1  DISCUSSION  OF  RESULTS 

Let  X  have  distribution  F  as  before.  Given  xQ>0,  let  Y  be  distri¬ 
buted  as  the  conditional  distribution  of  X-xQ  given  that  X>xQ.  Then 

S(x  +y) 

P ( Y >y )  =  P(X-xQ>y |X>xQ)  =  j  ,  y>0  . 
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Thus  the  equation  U(b)  =  2  has  at  most  one  positive  solution  b.  It  does 
have  such  a  solution  if  and  only  if 


(5.1) 


According  to  (4.19) 


■( (xn+Y)b  -  xb)/a 


S(xQ+y)  =  S(xq)  e 


,  y>0 


It  is  convenient  to  set 

-  S(xQ) 

s(y)  =  log  - - ,  y>0  , 

S(xQ+y) 


so  that 


s(y) 


(xo+y)b  -  xq 


a 


y>0  . 


A  problem  of  obvious  interest  is  to  determine  the  asymptotic  dis- 

A 

tribution  of  s(y)  as  an  estimator  of  log(S(xQ)/S(x0+y) )  as  m-*x>  and  x^ 
in  some  manner.  To  do  so,  it  is  necessary  to  determine  both  the  asymptotic 
variance  and  asymptotic  bias  of  s(y).  The  asymptotic  variance  of  s(y)  will 


r^iii  — 
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be  studied  here  as  *i-*»  for  fixed  xQ  and  then  Xq-x*  (The  asymptotic  bias 
appears  to  be  amenable  only  to  numerical  analysis  in  specific  cases.) 

As  m-w°  for  fixed  xQ,  the  random  quantities  U,  b,  a  and  s'  converge 
in  probability  to  certain  quantities  U,  b,  a  and  s,  which  will  now  be 
defined.  Set 


E((x0+Y)b  -  xg)2 
[E((x0+Y)b  -  xb)]2 


b>0  . 


Assume  that  all  moments  of  Y  are  finite.  Then  by  Theorem  4.2,  U  is  con¬ 
tinuous  and  strictly  increasing  on  (0,oo),  U(oo)=oo  and 


U(0)  = 


E  log 


E  log 


2  X0+Y 
XQ 

( VY  f 

1  xo  ' 


Thus  the  equation  U( b ) =2  has  at  most  one  positive  solution  bo  It  does 
have  such  a  solution  if  and  only  if 


When  (5.2)  holds,  set 


and 


?  Xn+Y 
E  log2  -f- 

XQ  9 
x7+YT7  <  * 

E  4r) 


a  -  E( (xQ+Y)b  -  xj) 


s(y)  = 


(x0+y)b  -  xj 


y>0 


(5.2) 
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Assume  that  (5.2)  holds  for  all  sufficiently  large  values  of  xQ  and 
also  for  the  specific  value  of  Xg  under  consideration.  Then  by  the  weak 
law  of  large  numbers,  (5.1)  holds  with  probability  approaching  one  as  m-«° 
for  fixed  xQ. 

A  A 

The  random  variables  b,  a  and  s(y)  converge  in  probability  to  b,  a 

and  s(y)  respectively  as  m-*°°  for  fixed  Xg.  Moreover  there  are  positive 

numbers  oM  cuandcu  (depending  on  xn)  such  that  as  m-*°  for  fixed  xn 
b  a  s(y)  0 

^  (vfii  (b-b))  N(0,cu)  , 

X(v¥  (a-a))  ->  N(O^j^) 

a 

and 

^  (v€(s(y)  -  s(y) ) )  N(0,cu  )  . 

s(y) 


Given  z>0,  let  y  be  defined  in  terms  of  z  land  Xg)  by 


s(y)  = 


(xn+y)b  -  xn 


Also  set 


Z  = 


(xo+Y) 


X 


b 

0 


and  note  that  EZ=1. 

The  asymptotic  behavior  of  o*  and  cu  as  xn-*»  will  now 

b  a  s(y(z) ) 


be  determined. 
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Theorem  5.1 .  Suppose  that 


-V  =  E(1  +  Y/x0)b  -  1  -  0  as  x0  -  -  . 

xo 

Suppose  also  that  as  Xq-w°  the  distribution  of  Z  converges  to  the  exponen¬ 
tial  distribution  with  mean  one  and  that  EZ4  converges  to  the  fourth 
moment  of  the  limiting  distribution  (which  equals  24).  Then  as  Xq-«° 


_ ] _ 

(a2  +  (b  xb  (log  xQ  +  1/b)  )2)1/2 


CU  ->  1 

a 


(5.4) 


a.  -  z2( (z/2  -  l)2  +  1)  .  (5.5) 

s (y(z)) 


By  itself,  Theorem  5,1  doesn't  provide  much  justification  for  using 
Weibull  tail  estimators.  In  order  to  obtain  some  such  justification,  sup¬ 
pose  that  the  Weibull  tail  model  is  exactly  valid  for  the  value  of  x^ 
under  consideration  (and  hence  for  all  larger  values  of  x^);  that  is, 
suppose  that 


P(Y>y)  =  exp 


-(f). 


y>  0 
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where  c  =  a1//b,  Then  above  assumptions,  including  those  of  Theorem  5,1, 
are  clearly  satisfied.  Also  s(y)  is  a  consistent  estimator  of 


and  as  m+oo  for  fixed  x. 


{4 


S(xJ 

sCyl  ■  109  sC^+yT 


SCx  ) 

S(y)  -  1og  Slx^+yT 


(o,cr£  ) 

V  s  ( Y ) 7 


Let  b,  c  and  T(y)  be  consistent  maximum  likelihood  estimators  of 
b,  c  and  s(y)  based  on  the  random  samp’e  Y^,...,Ym.  Then  there  are  posi¬ 
tive  numbers  a  ,  a  and  a  (depending  on  xn)  such  that  as  m-*x>  for 
b  c  s(y)  0 

fixed  x^ 


(b-b) ) 
(c-c)) 


and 


s(y)-s(y) ) ) 


Theorem  5.2.  Suppose  the  Wei  bull  tail  model  is  exactly  valid  for 
some  Xq.  Then  as  x^+oo 
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and 

a2_  -  z2((z/2  -  l)2  +  1)  .  (5.7) 

s(y(z) ) 

A  A 

Thus  b  and  s (y(z))  are  asymptotically  efficient  estimators  of  b  and  s(y(z)) 
respectively  as  first  m-*«>  and  then  x0-*». 


By  the  central  limit  theorem 
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l£url)-Op(1/J/2) 

H 2  +  iS  <Z1  -  «2> 

1  ' - 7^ - 77, 

i  (z,-n  +  onM4 

i  p\m/ 

=  (y2 +  lE  (zi  -^H1  -  iBV1)) +  Qp(wr) 

and  consequently 

0(b)  -  0(b)  -  lfu?-2b2  Z,V  *  0p(l)  . 

Therefore  by  the  central  limit  theorem 

JC  (vfii’(U(b)  -  U(b)))  N(0,y^-4y2U3  +  4y2 

A 

Now  b  is  a  consistent  estimator  of  b.  To  see  this,  let  0<b^<b<b2<  ®. 
Then  U ( b-j  )<U ( b )  =  2<U(b2)  according  to  Theorem  4.2.  By  the  weak  law  of 

A  /S 

large  numbers,  U(b-j)  -*■  U(b-j)  <  2  in  probability  and  U(b2)  -*■  U(b2)  >2  as 

/•*  /s 

n>*».  Since  U  is  monotonic,  the  probability  that  the  inequality  b^<b<b2 


and 


Thus 
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holds  converges  to  one  as  m-*-*  .  Thus  b  converges  in  probability  to  b  as 
m-**,  as  desired. 


By  the  mean  value  theorem  of  calculus 


U(b)  =  U(b)  +  U'(b0)(b-b)  =  2, 


where  bg  is  between  b  and  b.  Thus 


i .  b  -  “M-i 
u'(»o> 


Since  b  converges  in  probability  to  b  as  m-°°,  so  does  b  .  It  follows  by  a 

/\ 

direct  computation  that  U'(b)  converges  in  probability  to  U 1  ( b )  as  m-*-»  and 
that 


max  |U"(b~) f  =  0  (1)  for  0<b  <b<b0«»  . 
bi<b3<b2  3  "  12 


Consequently  U'  (bg)  converges  in  probability  to  U'(b)  as  nn<»  .  Therefore 


b_b  = "  +  °p(  jW 


and 


^  (/m~  (b-b))  -»■  N(0,aX  i  , 

b^ 


=  - - - ?  (m4  “ 4 y2y3  +  4y2  "  v\) 


b  (U'(b)r 


where 
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Define  functions  a(*)  and  a(*)  by 


and 


Then 


a(b)  =  E((xQ+Y)b  -  xb),  b>0. 


7  J'l  i_  L 

=  m  ^X0+Yi^  "  V’  b>0 


a(b)  -  a-J  L(Z.-l) 


By  the  mean  value  theorem 


^  A  A 


a  =  a(b)  =  a(b)  +  a'  (bQ)’(b-b) , 

A 

where  bg  is  again  between  b  and  b.  It  follows  as  in  the  previous  paragraph 

/A 

that  a'(bg)  converges  in  probability  to  a'(b)  as  m^°.  Thus 


a-a  =  a(b)  -a+a1 (b)(b-b)  +  0 p |  ]/ 2) 

=  i  +  (Zi  "  2y2  Zi+1)]  +  °p(n7T 

Therefore  by  the  central  limit  theorem 


£  (/rtf  (a-a) )  N (0 ,af  )  , 

a  i 


where 


al  =  a2(y2-l)  +  (^j)  (y4-4u2u3+4u^-yZ) 
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+  £i3_LLL  (u  -  2u;  +  u?). 
IT (b)  3  3  L 


It  now  also  follows  from  the  central  limit  theorem  that 


X  (/m"  (s(y)  -  s(y) )  -*-  N  l0,a 


where  a*,  .is  too  messy  to  justify  writing  down  here.  Observe  that  if 

s(y) 

y,  z  >  0  are  such  that 

,  x  (xo+y)b  ‘  xb 


(xQ+y)b  =  Xq  +  az 


(xQ+y)b  log(xQ+y)  -  xj  log  xQ 


az  log  xQ  +  1+  *§  I  log  /l+  ^ 

\  xQ/  '  xq 


In  particular 


(xQ+Y)b  log(xQ+Y)  -  xb  log  xQ 


-aZlo<,x0^(HS|)log(l*g 
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Since 


r  2 

log(l+t)  -  1  +  -^-  =  t  0(min(t,l)),  t>0. 


it  follows  from  the  hypotheses  of  Theorem  5.1  that  for  k=0,l,2  (as  >^-*=°) 


zkH,+S- 


aZ  .  1/aZ 

T  2  It, 
xo  V 


' 0 


Consequently 


a '  (b)  =  E((^+Y)blog(>^+Y)  -  xb  log  xQ) 


=  a(log  x0  +  ^  +  fr +  0  (fr) 
bx.  \bxn  / 


and 


E[Z((^+Y)blog(>^+Y)  -  xb  log  xQ)] 


■a“2(,O9X0^)+StO©- 

0  '  0 


It  now  follows  easily  that 


bx 


2a 


?  U'(b)  -  1 


and  hence  from  the  above  formula  for  o£  that  (5.3)  holds.  It  also  follows 

2 

from  the  above  formula  for  ac  that  (5.4)  holds. 

a 
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As  n>«°  for  fixed  x 


0 


s(y(z))  -  z 


bz  logx 

L  \  v  \  v 


/b-bx  ,a-a  .  n  (  1  \ 

<—>  -  2t  +  °ph;  • 


Let  (W^W^)  have  a  bivariate  normal  distribution  with  EW-j  =  EW2  =  0, 

3  2 


and 


Then 


where 


Var  W1  =  -  4p2u3  +  4u2  -  u2, 

Var  W2  =  p2-l , 


Cov(W15W2)  =  y3  -  2u2  +  u2< 
CTs(y(z) )  =  Var(f(z)Wi  -  zW2)» 


f(z)  = 


Now  (as  Xg-»°) 


f(z)  +  §  (f -1). 

It  follows  from  the  hypotheses  of  Theorem  5.1  that  Var  W-|->4,  Var  W2~>4  and 
cov(W1 ,W2)-^0.  Therefore  (5.5)  holds,  which  completes  the  proof  of 
Theorem  5.1. 

Suppose  the  Weibull  tail  model  is  exactly  valid  for  the  x^  under 
consideration.  Let  f  denote  the  density  of  Y  and  let 
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bb 

be 

w 

denote  the  Fisher  information  matrix,  defined  by 


3^1oq  f 


fdy  , 


-  f  cMog_f  fdy  f 
3b3c 

.jshi a_t  fdy . 

3c 


Set  Zg  =  (Xg/c)b  and  let  ( Zq )  denote  the  exponential  integral  given  by 


oo  -2 

El(z0)  =  /  V" 

Zn 


It  follows  by  straightforward  computations  that 


l  r«  ?  ?  ~(z-zn) 

rbb  =  ~7  ^  j  (z  log  z  '  z0  log  zo')  e  dz  » 

b  z0 

rbc  =  ‘  c(log  z0+  1  +  ^  El(z0)K  / 


I  -  b 
!cc  2  * 
c 


Let  I  denote  the  inverse  Fisher  information  matrix,  so  that 
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1  l^cc  *bc 

'bb'cc  -  <WZ  VIbc  Ibb^ 


‘bb'cc  -  'W 


D(Z0) 


where 


(z-zj 


D(zq)  =  1  +/  (z  log2z  -  zQlog2z0)  e  0  dz 


-  (log  zQ  +  1  +  e  0  E-|  (Zg) )2  . 


Let  b,  c  and  s(y)  be  consistent  maximum  likelihood  estimators  of 
b,  c  and  s(y)  respectively  based  on  the  random  sample  Y, Y  .  The 
asymptotic  joint  distribution  of  Vm(b-b)  and/m~(c-c)  is  a  bivariate 
normal  distribution  with  mean  0  and  covariance  matrix  I"\ 

Observe  that  (as  Xq-»°  or,  equivalently,  as  Zg-«°) 


e°  E,W4-Vo4 


and  hence 


=  ‘  r  l°9zn  +  ^+  ,  _  5  +  0 


•be  -  -  c  ,uy  £o T  1  T  i;  -  7T  u  7 

I-  0  zo  \zo 


By  straightforward  calculations 


i 
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I  =  — 
*bb  .2 


(log  z  0+  l)2  +  (log  zQ  +  1)^  -  +  \  +  0  (-^-j 


O'  ‘0 


Consequently 


<c2o)2(‘bb‘cc  -  Hk.)2)  -  1  • 


be' 


Since 


2  t"1 

^b  "  1  bb 


I 


cc 


.  2  .  2 
b  /c 


Vcc-tW2  IbbIcc-»bc)2 


(5.6)  is  valid. 

As  m-K»  for  fixed  x 


0 


where 


(/F  (s(y)  -  s(y) ) )  -*■  N(0,  a|^)  , 


2 

as(y) 


(*Q+y 


log 


x0+y 


f)  109  ~c 


i2 


bb 


'¥-)  i*  ^ 


+  ^s'Cy)!-1 
c 


cc 


1  x° 
°g  ~ 


S  (y)  I 


-1 

be 
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6.  ASYMPTOTIC  BEHAVIOR  OF  EXPONENTIAL  TAIL  ESTIMATORS 

Recall  that  X  is  a  positive  random  variable  having  distribution 

function  F  and  that  £(x)  =  -log  P(X>x)  =  -log  (l-F(x))  =  -log  S(x),  x>0. 

It  is  supposed  throughout  this  section  that  l  is  twice  differentiable  and 

2 

2,'>0  on  (0,°°).  Recall  also  that  r  =  £"/(£')  on  (0,“).  Results  concerning 
the  asymptotic  behavior  of  exponential  tail  estimators  of  a  quantile  xp^n 
are  stated  in  Section  6.1  and  proven  in  Section  6.2. 

6.1  DISCUSSION  OF  RESULTS 
Suppose  that 

1 im  r(x)  =  0  .  (6.1) 

x-k» 


Then 


lim  x£'  (x)  =  oo  ,  (6.2) 

X-*oo 


so 


lim  U(x+y/Z'  (x))  -  Jl(x)  =  y,  -«><y<°°  ,  (6.3) 

x-*» 

and  hence 


1  im 
x-*°° 


S(x+y/fc'(x)) 

S(x) 


,-y 


-oo<y<oo 


(6.4) 


80 


As  pointed  out  in  Section  2.3  in  connection  with  the  discussion  of  heavi¬ 
ness,  (6.1)  lipids  for  Wei  bull  and  lognormal  distributions,  but  the  con¬ 
vergence  is  very  slow.  Similarly  there  is  slow  convergence  in  (6.4)  or, 
equivalently,  in 


lim  p_1  S(x  +y/A'(x  ))  =  e'y,  -«xy<oo 

p-MPO  "  r 

Recall  that  =  Xi  =  max  (X^,...,Xn).  It  follows  from  (6.4)  that 
lfm  Fn(x,/n+y/Z'(x1/n)) 


=  lim  P ( £ 1 ( x 

n-H» 


1/n 


><VW 


y) 


-oo<y<oo 


(6.5) 


Proofs  that  (6.2)  -  (6.5)  follow  from  (6.1)  will  be  qiven  in  Section  6.2. 
The  classical  result  that  (6.1)  implies  (6.5)  is  due  to  von  Mises  [10]. 

Given  a  positive  integer  m,  let  Gm  denote  the  distribution  function 
given  by 

Gn/y)  =  E  kT  e  ky  e  6  »  -“*/«»  ; 

o 

its  density  is  given  by 


37  Vy> 


e"my 
(m-1 ) ! 


-oo<y<oo 
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I 


y 

(If  Y  has  distribution  function  Gm,  then  e  has  a  gamma  distribution  with 

parameters  m  and  1 . )  The  moments  of  Gm  are  all  finite.  Let  u  and  a2 

m  mm 

denote  respectively  the  mean  and  variance  of  G  .  Then 

m 


«,  ■  /  y  -y  e_e 


-y 


on 

dy  =  -j  log  z  e'z  dz  =  y  , 


where  y  «.5772  is  Euler's  constant.  Also 


2  -y  -e~y 

&  a 


I  y~  e  '  e 

-oo 


2  .  _-z  .2 


dy  =  |  log"  z  e  ‘  dz  =  j  +  y 


2  2 

so  o-|  =  it  /6.  It  is  easily  seen  that 


1 


J_  p-(m+l )y  e-e'y  me-m(z-y)  dy  =  ^  .  e-mz  g-e 


-z 


m! 


(m-1 ; ! 


so  that  G^  is  the  convolution  of  G^m+^with  an  exponential  distribution 
.  2 

having  mean  1/m  and  variance  1/m  .  Consequently  um  =  um+1  +  1/m  and 
2  2  2 

~m  =  am+l  +  ''//m  anc*  therefore 


rtJ  i  o  2  m~  1  ,  co 

y  -  ^  ^  and  am  =  -g-  -  t, 

Ik  m  k 


ular  the  variance  of  Gm  approaches  zero  as  m-*®. 

■  i ' 1  that  Xm  is  the  mth  largest  value  among  X^,...,X  (the 

'  n  or  n  is  suppressed).  The  next  result,  which  strengthens 
-  r),  will  be  proven  in  Section  6.2. 


r trz-rT* — r'~3 


tti  . 
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Theorem  6.1 .  Suppose  (6.1)  holds.  Then  as  n-**>  the  distribution 
function  of  V  (xi/n)  ^m”xl/n^  conver9es  to  anc'  the  moments  of  this 
random  variable  converge  to  the  corresponding  moments  of  G  . 

Given  a  positive  integer  m  and  a  positive  number  p  with  0<p<m+l , 
let  xp^'  denote  the  exponential  tail  estimator  of  the. quantile  xp^n  based 
on  a  random  sample  of  size  n  as  given  by  (2.3);  so  that 


it  ■  Vi  +  T 


(6.6) 


where 


g(m)  _  1  y  (x  _x  \ 
a  m  i  m+l' 


(6.7) 


The  asymptotic  behavior  of  xp^  as  n-*»  for  fixed  m  is  described  in  the 
next  result,  which  will  be  proven  in  Section  6.2. 

Theorem  6.2.  Suppose  (6.1)  holds  and  0<p<m+l .  Then  as  n-*°  the 
distribution  function  and  moments  of  £'  (xyn) ^p/n-xp/n^  conver9e  to  those 
of  a  random  variable  which  has  finite  moments  of  all  orders  and 
positive  variance;  moreover  E(Zpm^)^  -+•  0  as 

The  following  easily  obtained  consequence  of  Theorem  6.1  and 
Theorem  6.2  will  also  be  proven  in  Section  6.2. 

Theorem  6.3.  Suppose  (6.1)  holds  and  p>0.  If  (mn>  is  a  sequence 
of  positive  integers  which  tends  to  infinity  sufficiently  slowly  as 
n-«°,  then 
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1  im 

n-x» 


(m  ) 

~  n 

^xp/n  ~xp/n 

Var<;^> 


=  0, 


0<p<m+l 


and 


lim 

n-K» 


[>n} 

^P/n  ~Vn, 

Var  X 


m 


=  0 


in  particular 


1  im 

n-K» 


E 


(m  ) 

j  n 

Xp/n  ~xp/n 
Var  M 

n 


=  0 


Theorem  6.1  and  Theorem  6.3  together  yield  the  following  result, 

(m„) 

p/n 


which  shows  that  under  appropriate  conditions  x„,"  is  a  consistent 


estimator  of  x 


p/n* 

Theorem  6.4.  Suppose  that  p>0,  that  (6.1)  holds  and  that 


lim  £'(x)  =  oo  .  (6.8) 

X-H» 


If  {mn>  is  a  sequence  of  positive  integers  which  tends  to  infinity  suffi¬ 
ciently  slowly  as  n  ,  then 


1  im 
n-*-a> 


=  0 


Note  that  (6.8)  holds  for  such  light-tailed  distributions  as  Weibull 
distributions  with  shape  parameter  b>l.  But  it  fails  for  medium-tailed 
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and  heavy-tailed  distributions  in  which  r(x)  <  0  for  x  sufficiently  large; 
in  particular,  it  fails  for  exponential  distributions,  for  Weibull 
distributions  with  shape  parameter  b<l,  and  for  lognormal  distributions. 

6.2  PROOFS 

Suppose  from  now  on  that  (6.1)  holds.  It  will  first  be  shown  that 
(6.2)  holds.  Suppose  otherwise.  Then  there  is  a  positive  number  M  and 
there  is  a  sequence  {x^}  of  positive  numbers  tending  to  infinity  such 

that  Xj.rtXj)  <  M  for  j  >  1.  By  (6.1)  there  is  a  positive  number  xQ  such 
that 


_d_  1  ,  v  l 

dx  £' (x)  “  1  2M  ’  x  —  xc 


Choose  j  such  that  x^.  >  xQ.  By  the  mean  value  theorem 


1  „  xj-x  x.j 

<  -a..  <  ^4 


(Xj)  "  V  (x)  -  2M  -  2M  *  xo±-x-xj 


and  hence 


x  <x<x 


j 


Consequently  ttx^)  <  t(xQ)  +  2M  for  j  >  1,  which  contradicts  the  fact 
that  t(x)  = -1 og  P(X>x)-™  as  x-«°.  Therefore  (6.2)  holds  as  desired. 
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Next  (6.3)  will  be  verified.  By  the  mean-value  theorem  i 
to  show  that  for  M>0 


lim  -  ^  ^  =  1  uniformly  for  |y|  <  M 

x-xx)  ' x ' 

Choose  z  >  0.  By  (6.1),  (6.2)  and  the  mean-value  theorem  there  i 
xQ  >  0  such  that  if  x  >  xq  and  |y|  £  M,  then 


1 

£'  (x+y/£‘  (x)) 


1 

I'  (x) 


z  M 

vT7> 


and  hence 


I  A1 (x) 

|  £  ‘  ( x+y/ £ ' ( x ) ) 


<  eM 


Therefore  (6.9)  is  valid  and  hence  so  is  (6.3).  Now 

S'(X^sfxSX))  =  expC-(^(x+y/£'(x))-£(x)] 
so  (6.4)  follows  immediately  from  (6.3).  Also 


(xl/n+y/^xl/n^ 


S(x1/n+y/£'(x1/n)) 


s(xW 


so  (6.5)  follows  from  (6.4).  This  completes  the  proof  that  (6.2) 
follow  from  (6.1). 


suffices 


(6.9) 

s  an 


-  (6.5) 
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In  preparation  for  the  proof  of  Theorem  6.1,  two  further  elementary 
consequences  of  (6.1)  will  now  be  obtained.  Let  c>l.  Then 


1  im 
p-*o 


!c£1xc!p  .  1 
xp"xcp 


(6.10) 


and 


lim  ^(xp)(xp-xcp)  =  log  c 


p+o 


(6.11) 


To  prove  these  results,  choose  e  with  0<e<l.  By  (6.1)  there  i 


s  an 


xQ>0  such  that 


1  1 
£'(x+y)  "  VJx) 


icy  for  x  ^  xQ  and  y  >_  0 


In  particular 


1  1 
£' (x+y)  "  V  (x) 


-  TTxF  for  x  >  xQ  and  0  <  y  < 


Consequently 


1  (x+y)  <  Vt1  for  x  >  x  and  0  <  y  <  — 4 

1  b  i-e  —  o  — J  —  e£‘ 


e£ 1  ( x ) 
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Therefore 


l 


» 


i 

r 


S(X^'U))  •«P(«-Vrt-(»))  -  «*))  >  . 

Choose  c>l  and  let  e  be  small  enough  that  e^/£(^+£)  ^  c2.  Choose  pQ>0 

such  that  x  «  >  x  .  Then  for  p  >  p„ 

c2Po~  0  0 


£,(vP) 


2,1  (XC2P^ 

1-e 


for  x  o„  <  z  <  x„ 
c^P  -  P 


Also 


s<xc2p>  *<xcp>  -  t(xc2p> 
'  e 


Thus  loa  c  =  2,'(z)(x  -x  ?„)  for  some  z  between  x  o  and  x  „  and  hence 
-  cp  c^-p  c£p  cp 


(l-e)loq  c 


-  Xcp 


( 1+s) 1 00  C 
c2p  -  £'(x^2p) 


for  p  >  pQ 


Similarly 


(1-e) log  c 


< 


<  ( 1+e) 1 og 
cp  -  a1 (^l2p) 


c2r 


for  P  >  P0 


( 
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I 


and  consequently 


1-e  .  xcp  xc^p  1+e  £ 

__  <  — c. - £  <  __£l  for  p  >  n 

1+e  -  x  -x  -  1-e  K  -  Ko 
P  cp 


Since  e  can  be  made  arbitrarily  small,  (6.10)  is  valid.  It  is  now  clear 
that  (6.11)  is  also  valid. 

Theorem  6.1  will  now  be  verified.  First  the  well  known  result  that 
the  distribution  function  of  £' (x^n)(Xm-x.^n)  converges  to  Gm  will  be 
obtained.  To  this  end,  observe  that 


p(Xm£*)  =  £  00-F(x))k  Fn"k(x) 


and  hence 


PU'  (xi/nKVxi/n)  i  y) 

m-1  _-k,nYs(xl/n*y/*'(xl/n)\k  cn-k 


Nl“  I  “IW  II  \ 

■  L  "  (k> 


Since  limn  n  (£)  =  1 /k I ,  it  now  follows  from  (6.4)  and  (6.5)  that 


lim  p(r(xi/nHVxl/n 

IT*00 


m-1  .  .  -y 

)(Xm'xl/n)£y)  =  £  FTe  7  e"  =  Gm(y)’ 


-oo<y<oo 


as  desired. 


i 
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It  remains  to  prove  that  the  (positive  integer)  moments  of 
V  (xyn) (Xm"xi/n)  converge  to  those  of  Gm*  For  m=l  this  result  has  been 
obtained  in  a  more  general  context  by  Pickands  [11].  It  is  possible  to 
reduce  the  result  for  m>l  to  that  for  m=l.  But  since  Pickand's  proof  is 
rather  complicated,  a  self-contained  proof  for  m>1  will  be  given. 

Since  convergence  in  distribution  has  already  been  established,  it 
suffices  to  show  that  the  moments  of  it’  (x-j/n) ( xm-xi /n ^  are  bounded  in  n. 
For  this  it  suffices  to  show  that  for  each  positive  integer  j 

Tnl  *  i  x,/„-y)<iy 


and 

T„2  =  P0tm  >  x,/n+y)dy 

are  bounded  in  n.  To  verify  that  is  bounded  in  n,  it  is  enough  to 
show  that  for  0  <_  k  <_  m-1 

xl/n 

Tn3  =  (xl/n))J  /  (x1/n-y)J’"1(nS(y))k(l-S(y))n-kdy 

is  bounded  in  n. 

Choose  5  with  0  <  6  <  1/2.  Choose  i(n)  such  that  (for  n  sufficiently 

large) 

62n  <  2^n^  <  6n 
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Then 


Tn3  =  ^'^l/n 


'  V/n 


(xl/n-y; 


.  ,x21(n)/n 

+  j  ( *Vn-y)}-] 


Observe  that  if  x 2 i / n  f.  y  ±  X21-Vn’  t*ien 


(x1/n-y)j-1(nS(y))k(l-S(y))r 


-  (xl/n_X2i/n)J  ^nS(x2i/n))k 


,i -1 


(xl/n“x2l/n)j2lk(1-^n”j 

<-^vn-*zv^k-*-2"  ** 


Observe  also  that  if  0  <  y  x 2 i / n »  then 


(x1/n-y)J'-1(nS(y))k(l-S(y))r 


1  (’W" 


,n-k 


"k  ^<n>  ^ 


<  (x1/n)j-1  nk  e'6  n  2~k 


3_1(nS(y))k(l-S(y))n-kdy 


(nS(y) )k(1-S(y) )n_kdy 


-k 

l-S(x21..1/n))"-k 
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Thus  to  prove  that  is  bounded  in  n,  it  suffices  to  show  that 


Tn4  =  (r(xi/n))J  ^  (x2i-1/n"x2i/n^xl/n"x2i/n^  1  2  1  e  2 


i-1 


and 


Tn5  =  (xi/n^J  (xl/n^  1  n  e  6  n 


are  bounded  in  n. 

Choose  e  >  0.  By  (6.10)  and  (6.11)  if  6  is  made  sufficiently  small, 
then  (for  n  sufficiently  large) 


KKKn) 

x21-l/n"x21/n 


and 


5-,(x1/n)(xi/n-x2/n^  -  1 


Consequently 


hence 


x2l-l/n-x2Vn  -  tTxj^T  • 


xl/n-x2Vn^P(x,/n)I1TL  •  • 
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and  therefore 


Tn4±  £ 


i/n  2i  „ki  -21"1 


(1+e)  2  e' 


which  is  bounded  in  n.  Similarly  it  follows  from  (6.10)  and  (6.11)  that 


r(x1/n)  =  0((l+e)log2n)  =  0(nlog2(1+e) 


and  hence  that 

Tn5  =  0(nlog2(1+£)+k  e_<S^n)  . 

Thus  Tn5  is  also  bounded  in  n.  This  completes  the  proof  that  T^  is 
bounded  in  n. 

To  prove  that  J  ^  is  bounded  in  n,  it  suffices  to  verify  the 
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By  an  argument  similar  to  that  used  in  verifying  the  boundedness  in  n 

ft 

of  T^,  it  now  follows  that  is  bounded  in  n.  This  completes  the 
proof  of  Theorem  6.1. 

In  preparation  for  the  proof  of  Theorem  6.2,  another  elementary 
consequence  of  (6.1)  will  be  obtained: 


lim  ?''^xl/n^F"^1'n^xl/n^  =  log  z’  z>0 

n-K-n  '  ' 


(6.12) 


Choose  e  >  0.  By  (6.4) 


lim  n 
n-x» 


log(l/z)+e^ 
xl/n^  j 


e”ez 


and 


n~>  \  u/n 


=  eez 


so  for  n  sufficiently  large 


log  1=  o  <  i'(x,/n)(F-’(l-f)-x1/n)  <  log  It  c  . 

Since  e  can  be  made  arbitrarily  small,  (6.12)  holds  as  desired. 

Theorem  6.2  will  now  be  verified.  Let  W. be  independent 

I  m  i 

variables  such  that  W.  has  an  exponential  distribution  with  mean  1  for 
each  i  and 
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\ 


£*>■'£  (!) 


m-1 


‘m+1 


-kx, -i.  -x>n-k 
e  (f-e  )  ,  x>0 


let  W-j , . 
function 


.  ,Wm,  Z^-j  be  independent  and  suppose  that  has  distribution 
Gm+^ .  Then  Z^j  -  log  n  converges  in  distribution  to  1  ^  as 


n-*=°.  For  l<i<m  set 


7(0)  _  7(0) 

£i  m+1 


m  W. 


and 


Zi  " 


‘m+1 


m  W. 

E^- 

i 


j 


Then  z|n ^ , . . .  ,Z^j  are  well  known  to  be  jointly  distributed  as  the  m+1 
largest  values  in  a  random  sample  of  size  n  from  an  exponential  distribu¬ 
tion  with  mean  one;  and  as  n-*»  the  joint  distribution  of  zjn^  -  log  n,...,* 
Z^j  -  log  n  converges  to  that  of  Z^,...,Zm+^.  By  the  first  result  in  the 
last  sentence 


-Z 


1  -e 


(n) 

1 


. . ,1-e 


z(n) 

m+1 


are  jointly  distributed  as  the  m+1  largest  values  in  a  random  sample  of 
size  n  from  the  uniform  distribution  on  [0,1].  Consequently 
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are  jointly  distributed  as  the  m+1  largest  values  in  a  random  sample  of 
size  n  from  F;  that  is,  they  have  the  same  joint  distribution  as 

Xm+1 '  Therefore 


are  jointly  distributed  as 

^,(xl/n)(xrxi/n)»'--^l(xi/n)(xm+Txi/n) 

By  monotonicity  the  convergence  in  (6.12)  is  uniform  on  compacts.  This 
now  implies  that  the  joint  distribution  of 

5,1  ^xl/n^Xl‘xl/n^’*',,A'^xl/n^Xm+rxl/n^ 


converges  as  n-«°  to  that  of  Z-j .  .Z^ .  (For  more  general  results  to 
this  effect,  see  Weissman  [17],  [18].)  This  and  Theorem  6.1  together 
imply  that  as  n-*»  the  distribution  and  moments  of 


r  (xl/n^xp/n‘xl/n) 


£’(xl/n) 


Xm+l"xl/n  +  log  p  I  £  ^Xrxl/n^XrrH-rxl/n^ 
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converge  to  those  of 


z^i +  ?(zcW  *z, 


m+1 


log 


m+1 

P 


i$»i 


By  (6.11) 


^  l!'(xl/n><xl/n-Vn)  '  109  p  ' 

Therefore  as  r+x»  the  distribution  and  moments  of  (x-j/n)  ^p/n-xp/n)  converge 
to  those  of 


Vi  +  los  P  + log  fill  . 

This  random  variable  has  finite  moments  of  all  orders.  In  particular  its 
variance  is 


Var  Z 


m+1 


♦  s  I"2 


m+1  _  v1  i  j.  *  m+i 

T'5  3  1  ’  9  -T 


m+1  k 


m+1 

P 


which  is  positive  for  each  m  and  approaches  zero  as  m-K*>.  Also  its  mean  is 

m  , 

E  Zm+1  +  1  °g (nr+ 1 )  =  y  -  +  log(m+l) 

which  approaches  zero  as  m-*».  Therefore 

l™  E^,  +  log  p  *  log  Sill  =  0  , 

which  completes  the  proof  of  Theorem  6.2. 


Finally  Theorem  6.3  will  be  verified.  Let  {e  }  be 

m 

positive  numbers  which  approaches  zero  as  m+<».  Let  nm  be 
integers  such  th3t 


E«!'(>'1/n)(^-Vn»2iE'Zpn')>2  +  em' 


where  is  defined  in  Theorem  6.2.  Let  m  be  positive 
p  n 

to  »  as  n-*»  and  such  that  in  <  inf[m;n  >nl.  Then  n„  <  n 

n  m  J  rn  — 

n 


E(‘,(xVn)(vH>-x 


p/n)  -E(Zp  )  +  £r 


Consequently 


,  /  (mn)  \2 

U'(x1/n»  E|xp/n  -xp/nj  =  0 


The  conclusion  of  Theorem  6.3  now  follows  from  Theorem  6. 


a  sequence  of 
positive 

n— nm  » 

integers  tending 
and  hence 


and  Theorem  6.2. 
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